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The purpose of this paper is to point out a parallelism between certain aspects 
of the arithmetical theory of fields, as developed chiefly by Krull (1), and a 
new approach to the theory of function spaces recently developed by Kanto- 
rovitch (2). The unifying concept is that of a partially ordered abelian group. 
Theorems 1 and 2 below are concerned with the representation of such groups 
by means of vectors with components lying in linearly ordered abelian groups, 
while Theorem 3 deals with the completion of such groups. While writing this 
paper, I had the privilege of seeing the proofsheets of a beautiful paper by 
Lorenzen (3) on semigroups, which not only contains my results but goes con- 
siderably further. Proofs of the theorems will therefore be omitted, and I shall 
content myself with their interpretation as theorems on partially ordered groups 
in the additive notation. The procedure used in proving Theorem 3, for 
example, which is essentially the ‘‘v-Gruppensatz” of Artin, van der Waerden, 
and Krull (Krull, l.c., p. 120), is thus seen to be a direct generalization of the 
process of completing the additive group of rational numbers by means of 
Dedekind cuts. 


In the classical theory of ideals in a finite algebraic field K, one succeeds in 
decomposing every principal ideal (a2) + (0) of K uniquely into a product of 
prime ideals: 

(a) = pe? pr”. 
Thus (a) can be represented by a vector {a,, az,---,an, 0, 0,---} the 
components a; of which are rational integers, all but a finite number of which 
are zero. The i** component a; of the vector is simply the power to which 


the 7“ prime ideal p; occurs in the decomposition of (a). This vector repre- 
sentation has the following properties. If 


(a) {a , a2, }, (b) — Be, }, 
(a+b) }, 


then 1) (ab) + 61, a2 + Be, +++ }, 
2) a divides b if and only if a; S Bi (i = 1, 2, --- ), 
and 3) y; = min (a;, B;). 
In Krull’s general (exponential) valuation theory, one simply removes the 
finiteness restriction and at the same time allows the exponents a; to be arbitrary 
teal numbers, or even to be elements of non-Archimedean linearly ordered 
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abelian groups (definition below). One then refers to the classical case as 
“finite and discrete.” Since there is no reason to assume that the vector- 
components are denumerable or ordered in any way, Krull uses the notation 
Now the non-zero elements of K form a group G under multiplication, and a 
great many of the important concepts of multiplicative arithmetic can be 
formulated in G, the operation of addition not being used. On the one hand 
this leads to a new and more general theory—the arithmetic of abelian groups— 
and on the other hand (because of the simplification) it may lead back to new 
results in K. The latter assertion is borne out by Lorenzen’s paper. One 
must of course assume the existence of a set S of “integers” in G such that 
1) the product of two integers is an integer, 2) the identity element of G is an 
i integer, and 3) every element of G is the quotient of two integers. We then 
ih | define division in the usual way: a divides b if b/a isin S. The principal ideal 
(a) generated by a is defined to be the set of all elements of G divisible by a. 
i We may also begin with such a set S (called a semigroup), namely a set in 
va ' { which a multiplication ab is defined satisfying the associative and commutative 
| bi laws, such that an identity element 1 exists in S, and such that the cancellation 
law (4) holds: ac = be implies a = b. S may then be extended to a group G 
in the usual way by means of formal quotients a/b. G is called the quotient- 
| group of S. 
He G (or S) is said to be reduced if two elements which divide each other are 
necessarily equal. If G is not already reduced we can replace it by the group G 
of principal ideals in G, which has of course the same arithmetical structure as G 
itself. An element u of G is a unit if both u and uw are in S. Evidently G@ 
is isomorphic with the factor group G/U where U is the group of units of G. 

We may now proceed to consider possible representations of the principal 
ideals of G by means of vectors, as described above for a field K. Condition 3) 
has now no meaning as it stands, since addition is not defined in G, so I shall 
simply discard it. (Actually Lorenzen retains 3) by referring it to a given 
system of ideals in G: if c is in the r-ideal (a, b), then y; = min (a,, B;). For 
r = 8 this is automatically satisfied. Theorem 2 below gives a representation 
satisfying this condition for r = ¢.) 

A vector representation of G can plainly be regarded as an embedding of G 
in the group T of all vectors of the type considered, such that the division 
relation in G is identical with the component-wise order relation in IT. Since T 
is an additive group, it seems natural to write G additively to begin with. Let 

_ therefore G be a reduced group with integral part S; write it additively, and 
ob fa | write a < b if a divides b relative to S. One easily verifies the following, 
Bi | | which I take as the defining axioms for a partially ordered abelian group: 

I. Gis an abelian group under +. . 
ERO II. G ts a directed partially ordered set under S, that is: 
4 1) a a, 
2) a5 b,b S aimplya = b, 
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3)a<b,b Scimplya Se, 

4) given a and b, there existe ¢ in G such thala Scbse«. 
IU. a bimpliesa +e b+ forallcinG. 

Axioms II 1, 2, 3) are the usual axioms for a partially ordered set; the theory 
of such sets has become quite popular recently (see e.g. Mac Neille (5)). As 
usual, one writes a 2 bforb S a,anda <borb>aforasbaxb. IfA 
is a subset of a partially ordered set P, an element b of P is an upper bound 
of A if a < b for all a in A, and a least upper bound if b S c¢ for every upper 
bound cof A. P is called a directed set if II 4) holds (6); this condition simply 
requires that every pair of elements of P (and hence every finite subset of P) 
be bounded from above. Lower bound and greatest lower bound are defined 
dually. If every pair of elements of P (and hence every finite subset of P) 
has a least upper and greatest lower bound, P is called by Birkhoff (7) a lattice. 
If, moreover, every subset of P bounded from above has a least upper bound, 
then P is a complete lattice; the dual property is then a consequence. P is 
linearly ordered if exactly one of the relations a < b, a = b, a > b holds for 
each pair of elements a, b of P. Such a set is evidently a lattice, but need not 
be complete. 

Let now G be a partially ordered abelian group. One easily verifies the usual 
rules for manipulating + and <, e.g. 


Sa,b S be imply Sat+h, 
a<b implies a+c<b+ce (alle). 


An element a = 0 is called positive; the set S of all positive elements of G is a 
semigroup. Evidently a < b if and only if b — ais in S, and hence the given 
order relation < is the same as division relative to S. By II 4) every element a 
of G is the difference of two positive elements. For if c be chosen such that 
asc,0 Sc then 


-a=c—(c— a), c2 0, c—a20. 


Hence G is the difference-group of S. Finally, by II 2), G is reduced. We 
thus see that the notion of a partially ordered abelian group is identical with 
that of the quotient-group of a reduced semigroup. 

We may now specialize the type of the order relation in G as indicated above 
for general partially ordered sets. Thus a linearly ordered abelian group is one 
in which the order relation is linear; a lattice-ordered (complete ordered) group 
ls one which is a lattice (complete lattice) under the order relation. Before 
proceeding to a discussion of these types, and the consideration of more general 
types, a few preliminary remarks are necessary. 

Observe that a < b if and only if —a => —b. Hence the mapping x > —z 
is not only a group automorphism of G, but also a “dual automorphism”’ of G 
regarded as an ordered set. There is thus a “principle of duality” in G. For 
example, the dual of II 4) is automatically established. Likewise, if every 
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pair of elements a, b of G has a least upper bound, denoted by sup (a, b), then 
every pair a, b has also a greatest lower bound given by 


inf (a, b) = —sup (—a, —b). 


Two partially ordered groups G and G’ will be called order-isomorphic if there 
is a group isomorphism x <> x’ between them which preserves order: if a < a’, 
b <= b’, then a S bif and only if a’ S b’. 

If G’ is a subgroup of a partially ordered group G, then the order relation in G 
obviously orders G’, and all the axioms are satisfied except possibly II 4). If 
II 4) holds for G’, I shall call G’ an ordered subgroup of G. This is equivalent 
to requiring that every element of G’ shall be the difference of two positive ele- 
ments of G’. On the other hand, a given partially ordered group H will be said 
to be order-embedded in G if it is order-isomorphic to a subgroup G’ of G.  @’ is 
then naturally an ordered subgroup of G. 


The prime example of a linearly ordered abelian group is the additive group P 
of real numbers, with < having its usual significance. P and all of its sub- 
groups satisfy the Archimedean axiom: 

If a > 0, B > O then there exists a natural number n such that na > 8B. 

Conversely, every linearly ordered abelian group satisfying this axiom is 
order-isomorphic with a subgroup of P. As an example of a non-Archimedean 
linearly ordered group, let I be the additive group of all real n-dimensional 
vectors {a,,---,@n}. We order I lexicographically, i.e. 


+++, an} > {Bi , » Bn} 


if the first non-vanishing difference a; — 6, , a2 — Be, +--+ is positive. One 
easily verifies that T is a linearly ordered group under this definition. If 
€.,---, €, are the unit vectors and 7 > j, then ne; < e€; for all nm. This con- 
struction can be extended to infinite vectors, and Hahn (8) has shown that 
every linearly ordered group is a subgroup of such a group. Since this more or 
less disposes of linearly ordered groups, we turn our attention to the construction 
of non-linear groups from linear ones. 

The most obvious procedure is to form the (unrestricted) direct sum I of a 
set of linearly ordered groupsT,. IT may be thought of as the set of al! possible 
“vectors” a = { ---,a,,--+-+ } with one component provided for each I’, , and 
a, ranging independently over T,. Two vectors are added in the usual way: 


We now order I’ component-wise: 


{---,a-,---} S{---,B,-++} ifa, S B, forall r. 


One readily verifies that the axioms are satisfied. Moreover, I will be non- 
linear if there is more than one T,. We shall call I a vector-group. Greek 
letters will be used for vector-groups and their elements. 
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Actually is lattice-ordered. If 
then sup (a, 8) = { , max (a,, B;), +++ }, 
inf (a, 8) = { ---,min (a,, B,), «+= }. 


It is customary to write max and min instead of sup and inf in a linearly ordered 
group, since max (a, 8) is simply a or 8, whichever is larger. 

If each I’, is Archimedean, we call T an Archimedean vector-group. In this 
case the components a, of the vector a can be regarded as real numbers, and 
if we denote by 7 the index class over which the descriptive index 7 ranges, 
then a may be regarded as a single-valued function on 7 to the real numbers. 
On the other hand we may start with any set 7’ and define IT to be the set of 
all real functions a = a(r) on T. Addition and order in T may be defined in 
the obvious way: 


y = a+ Bis the function y(r) = a(r) + B(r), 
a S Bifa(r) S B(r), all rinT. 


Every Archimedean vector-group is an ordered subgroup of such a group I. 
In Hahn’s construction mentioned above, one orders the set 7’ in any way and 
forms the additive group of all functions a such that the set of all 7 for which 
a(r) ¥ 0-is well-ordered under the ordering in 7. Such functions can then be 
ordered lexicographically. 

The problem with which we originally began can now be expressed precisely 
as the order-embedding of a given partially ordered group G in a vector-group I’. 
Our first theorem characterizes those groups G for which such an embedding 
is possible. 

THEOREM 1. A partially ordered abelian group G can be order-embedded in a 
vector-group if and only if it satisfies the condition: if some natural multiple na of 
an element a of G is positive, then a itself is positive. 

This is Lorenzen’s Satz 14 for the special case r = s. The necessity of the 
condition is obvious. The sufficiency can also be proved, without the use of 
ideals, by paralleling Krull’s proof (1.c. pp. 110-111) that a domain of integrity 
isa “principal order” if and only if it is integrally closed. I had known Theorem 
1 only as an analog of Krull’s theorem, whereas Lorenzen’s beautiful result 
contains both as the special cases r = sandr = d. However proved, it requires 
transfinite induction. 

From the standpoint of a logical development of the theory and construction 
of partially ordered groups, Theorem 1 characterizes the immense class of all 
possible ordered subgroups of all possible vector-groups. 

We pass now to lattice-ordered groups. Such a group, in the multiplicative 
notation, is simply one in which every pair of elements has a greatest common 
divisor. These were first studied in full generality by Dedekind (9), and some 
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years later by Levi (10). The fundamental properties of such groups have 
been derived anew in the additive notation by Kantorovitch (lc. §1) and 
Freudenthal (11), apparently without realizing the parallel to elementary arith- 
metic. For example, their decomposition 


a= a,— a, = sup (0, a), a_ = sup (0, 


is simply the expression of a fraction ‘in lowest terms.’’ As a new contribu- 
tion, they show that | a | defined by 


=a, +a_ 


has the usual properties of an absolute value. These writers are, however, 
primarily interested in the case when G is a real linear space, i.e. admits the 
real field as multiplier domain with the usual rules for multiplying vectors by 
scalars. This is always the case in function spaces of any interest. 

From the point of view of general lattice theory, a lattice-ordered group is 
rather special, since it is distributive and self-dual. The former was known to 
Dedekind, but Freudenthal’s proof (l.c., p. 642) is much simpler. The self- 
duality has been mentioned above. 

One readily verifies that a lattice-ordered group G satisfies the condition of 
Theorem 1, and hence can be order-embedded in a vector-group Tr. Now I 
is a lattice and so is G, and the order relation in G is that of [. But it does not 
follow necessarily that G is a sublattice of T in the usual sense, i.e. if a and b 
are in G then sup (a, b) relative to ! may not be the same as sup (a, b) relative 
to G. The next theorem takes care of this situation. 

THEOREM 2. Any lattice-ordered abelian group G can be order-embedded in a 
vector-group T' such that G is a sublattice of T. 

This is Lorenzen’s Satz 11 (along with Satz 4). In order to construct such 
aT’ one must introduce what Lorenzen calls ¢-ideals in G, this notion due origi- 
nally to Arnold (12). These ideals can be defined in any G, but if @ is lattice- 
ordered a t-ideal can be defined as a subset a of G bounded from below and such 
that 

1) ofaisinaanda b, then b is ina; 

2) if a and b are ina, then inf (a, b) isina. Except for the boundedness con- 
dition, this (or its dual) is Stone’s definition of an ideal in any distributive 
lattice (13). 

(As a rather interesting minor theorem not mentioned by Lorenzen, it is 
possible to set up a one-to-one correspondence between the prime subideals of a 
given prime p and the prime ideals of the corresponding valuation semigroup 
S, under which the inclusion relation is invariant. As a consequence, which 
can also be proved directly, if p and q are any two prime ideals then either 
they are relatively prime or one contains the other. Cf. Krull, l.c., p. 112.) 

We come now to the consideration of complete ordered groups. (In Lo- 
renzen’s terminology, a lattice-ordered group is called ‘‘v-complete,” or simply 

“complete,” while a complete ordered group is called “totally complete.” His 
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terminology is flawless, but I prefer to reserve the term “complete” for its 
well-established topological meaning. Kantorovitch calls it a “partially ordered 
topological group.”) For this theory I can only refer to §3 of Kantorovitch’s 

r. Here a notion of convergence is introduced; the functions x + y, 
sup (x, y), inf (x, y), and |x| are shown to be continuous, and the Cauchy 
convergence criterion is proved. I do not believe that the resulting topology 
has ever been used in arithmetic. 

(I might remark here that the results in Kantorovitch §2 hold without the 
assumption that G admits the field of real numbers, and that Satz 21f in §4 
holds in any lattice-ordered group. One simply uses standard arithmetical 
arguments, e.g. that if inf (a, b) = 0 then inf (Aa, ub) = 0 for arbitrary positive 
integers A, yu.) 

Our final theorem deals with the question of when a given partially ordered 
group can be completed. 

TuEeorEM 3. A partially ordered abelian group G can be order-embedded in a 
complete ordered group H if and only if G satisfies the condition: if the set of all 
natural multiples na of an element a of G is bounded from below, then a is positive. 

Moreover, H is essentially unique if we require that every element of H be the 
greatest lower bound of a set of elements of G. 

The condition stated is what Krull calls “complete integral closure,’ and 
Lorenzen “total closure,’’ and the sufficiency is an immediate consequence of 
the Artin-van der Waerden-Krull ‘“v-Gruppensatz”’ (Krull, l.c., p. 120). The 
notion of a v-ideal can be introduced in any partially ordered set P: a subset 
a of P bounded from below is a v-ideal if it contains every upper bound of the 
set of all lower bounds of a. The set of all v-ideals in P constitutes a complete 
lattice L under set-theoretical inclusion. P is order-embedded in L by identi- 
fying the element a of P with the principal ideal (a) it generates. Moreover, P 
is normally embedded in L, by which I mean that every element of L is the 
greatest lower bound of a subset of P. This means of completing a partially 
ordered set was used by Mac Neille (l.c., p. 443); indeed, a v-ideal is the same 
thing as the upper class of a cut. Mac Neille’s definition of a cut is a direct 
generalization of that of a Dedekind cut in the rational numbers. 

If Gis a partially ordered group, then we can add two cuts or v-ideals a, 6 in 
exactly the same way that we add two Dedekind cuts: a + 6 is defined to be the 
smallest v-ideal containing all the sums a + b with ain a, bin 6. Addition so 
defined coincides with that in G, since for principal ideals we have (a) + (b) = 
(a + b), and satisfies all the postulates for an abelian group except the existence 
of negatives. In fact, the set G of all v-ideals in G satisfies all our postulates for a 
complete ordered group except for this one lack. But the v-Gruppensatz states 
that G will be a group if and only if G is completely integrally closed. 

The necessity of this condition is almost immediate. It holds for any com- 
plete ordered group (see e.g. Kantorovitch, Satz 15f), and hence for any ordered 
subgroup thereof, 

’ The second part of the theorem is an obvious extension of the unicity theorem 
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of Arnold (l.c.) in the finite discrete case. Because of the normality condition 
one is able to set up a one-to-one correspondence a «> a between H and the group 
of v-ideals in G such that 1) a is the set of all upper bounds of a in G, and 2) 
a is the greatest lower bound of ain H. This correspondence is an order-iso- 
morphism between H and © leaving the elements of G invariant. This unicity 
theorem does not hold for arbitrary partially ordered sets. 

The v-system plays the decisive réle in the completion process because it is 
the only system of ideals which can possibly form a group. It is the only system 
which can do the job, does it well if it is possible at all, and then expires nobly— 
for in a complete group every v-ideal is principal and hence of no further use. 
For further study one must use a weaker system, preferably the (-system, or 
avail oneself of the Kantorovitch topology. 

Regarding the condition of ‘‘complete integral or ‘total closure” 
occurring in Theorem 3, it is easily seen that it is equivalent to the Archimedean 
axiom in the case of a linearly ordered group. (This shows incidentally that a 
non-Archimedean linear group cannot be completed, at least in the sense here 
used.) . We might therefore adopt it as the defining property of an Archimedean 
partially ordered group. In fact, Kantorovitch refers to it as the “Archimedean 
axiom” in an earlier paper. 

One easily sees that an Archimedean vector-group, as previously defined, and 
hence any ordered subgroup thereof, is Archimedean in this sense. Whether or 
not the converse is true—that every Archimedean group can be order-embedded 
in an Archimedean vector-group—is still an open question. This is a conjecture 
of Krull; Lorenzen proves a special case (§5, l.c.). I have been working on this 
for the past year, for the most part using ¢-ideals, but with no success whatever. 
Perhaps the Kantorovitch topology may open up a clue to the mystery. 


Massacuusetts INstTITUTE OF TECHNOLOGY, 
CAMBRIDGE, Mass. 
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QUASI-GROUPS'! 


By G. N. GARRISON 
(Received June 3, 1939) 


Introduction 


Briefly stated, a quasi-group is a multiplicative system in which, given any 
two of the three elements a, b, c, the third is uniquely determined by the relation 
ab = c. Obviously, multiplication is not necessarily associative. For this 
reason the quasi-group may be used to study generalizations of the associative 
law. Such studies have been made by A. Suschkewitsch, B. A. Hausmann and 
O. Ore,” and, more recently, by D. C. Murdoch.’ 

A different point of view is taken in the present paper. No form of asso- 
ciative law is required for the entire quasi-group. Instead, we have endeavored 
to discover properties of subsets of its elements. 

Section 1 is devoted to fundamental definitions. It also contains all the 
formal properties of complexes which we shall require. Section 2 deals with 
subsets of elements which have special associativity properties. Some of its 
results have been stated previously without proof by E. Schénhardt* for quasi- 
groups which contain a two-sided identity element. In Section 3 we correlate 
the results of the previous section for the case of commutative multiplication. 

Finally, in Section 4 we discuss quotient quasi-groups and homomorphisms 
of a quasi-group. This theory depends upon our notion of an invariant complex 
H (which is not necessarily a sub-quasi-group) defined by relations of the form 


(Ha)(Hb) = He. 


We are able to show that every complex with this property determines a homo- 
morphism and conversely, that every homomorphism of a quasi-group is iso- 
morphic with the quotient quasi-group determined by a suitably chosen in- 
variant complex. 

I would like to acknowledge my appreciation of the privilege I have had of 
discussing the details of this paper with Professor J. H. M. Wedderburn. 


1 Presented to the American Mathematical Society, Dec. 26, 1939. 

1 A, Suschkewitsch, On a generalization of the associative law, Trans. of the Amer. Math. 
Soc., vol. 31 (1929), pp. 204-214. 

2B. A. Hausmann and O. Ore, Theory of Quasi-groups, Amer. Jour. of Math., vol. 59 
(1937), pp. 983-1004. 

*D. C. Murdoch, Quasi-groups which satisfy certain generalized associative laws, Amer. 
Jour. of Math., vol. 61 (1939), pp. 509-522. 

4E. Schénhardt, Uber lateinische Quadrate und Unionen, Jour. fiir Math., vol. 163 
(1930), p. 203. 
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1, The quasi-group 


1.1. A finite set S of n elements a; , a, ---,a, is called a quasi-group of 
order n when: 

i) S possesses the group property, that is, when each of the n” products a,a; 
designates one and only one element of S; and 

ii) qa; = aa; and a,a, = aja, only if ¢ = j for all k from 1, 2, --- , n. 

We have at once the following two theorems: 

THEOREM 1.1. Condition (ii) implies that there exists in S a unique x satis- 
fying ax = b and a unique y satisfying ya = b for each pair of values of a, b in S. 

For by (ii) the n elements aa; (7 = 1, 2, --- , m) are all distinct. Thus one 
of them is the element 6. Let aa, denote it. Then x = a, is the required 
solution. The other part of the theorem follows similarly. 

THEOREM 1.2. A sufficient condition that a quasi-group be a group. is that 
a;(a jax) Ke (a;0;)ax (2, j; k= 1, 2, n). 

For, condition (i), the result of Theorem 1.1, and the hypothesis of the present 
theorem, form the definition of a group used by some writers. It has been 
shown to be equivalent to the more usual one in terms of an identity element 
and inverses.” 


1.2. Any sub-set H of the elements of a quasi-group S will be called a complex 
H in S; a complex that does not contain all the elements of S is called a proper 
complex. 

The complex composed of those elements common to two or more complexes 
H, K, M, --- will be denoted by H A K A M a .-.-- and will be called their 
intersection. If H A K be a null-set we write H A K = 0. To denote the 
complex consisting of those elements which are contained in at least one of the 
complexes we shall write H + K + M + ...- and shall call it the sum of H, 

By the product HK of two complexes H and K we shall mean that complex 
which contains all possible distinct products of the form hk where h and k 
denote elements of H and K respectively. And by H = K we mean that H 
and K consist of exactly the same set of elements of S. 


1.3. The number of distinct elements contained in a given complex is its order. 
We now list a number of simple theorems to which we shall have frequent occa- 
sion to refer. 

Let H, K, M, and T denote complexes in a quasi-group S. Then we have: 

i) If H be of order h, so are also Hs and sH for any element sin S. If also K 
be of order k, then HK (or KH ) is of order at least as great as the larger of h and k. 

ii) A K) = sH 8K and (H K)s = Hs Ks. 

For if a be any element of both H and K, then sa is an element of both sH 


* Hasse, H., Hoehere Algebra, p. 52. 
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and sK. Hence 
(1) s(H A K) S sH AQ sk. 


Also if b be any element of both sH and sK then b = sh = sk for some pair 
h<H,k < K. By §1.1 (ii) h = kandb < s\H A K). Thus 


(2) sH A sK S s(H A K). 


We compare (1) with (2) and have the theorem. 

iii) If Hs = Ks or sH = sK for some s, then H = K. 

For if h be an element of H, then for some k < K we have hs = ks. Thus 
h = k (§1.1ii) and H < K. Similarly K = H. Hence the theorem. 

iv) If H, K, and M all three have the same order, and if HK = M, then also 
Hk = M and hK = M for all k in K and h in H. 

For we have Hk the same order as H (cf. (i) above) and thus the same order 
as M. But all elements of Hk lie in HK and thus in M, so that Hk = M. 

v) If H, K, M, and T all have the same order, and if HT = M = KT or TH = 
M = TK, then H = K. 

For if t < T, Ht = M = Kt by (iv) and H = K by (iii). 


1.4. By a sub-quasi-group H we mean a complex H in S whose elements form 
according to §1.1 a quasi-group. We make three comments which will be 
useful later. 

i) A complex H is a sub-quasi-group of S if and only if HH = H. The condi- 
tion is necessary to satisfy §1.1 (i); and it is sufficient since it satisfies §1.1 (i) 
and since h,h; = hh; for i # 7, being impossible in S, is also in H, so that 
§1.1 (ii) is satisfied. 

ii) The order of H need not divide the order of 8. 

This is seen from the example below where H = (a, b) is a sub-quasi-group 
of order 2. 
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ili) The intersection of two sub-quasi-groups is either a sub-quasi-group or 1s zero. 

For the example in (ii) shows that the intersection may be zero, for there 
(a,b) A (e) = 0. IfnowH A K = T # O, then #’ lies in both K and H, 
and thus in their intersection 7', so that TT = T. 


2. Center, left, and right associative elements 


2.1. We shall call an element c of S center associative in S if x(cy) = (xe)y 
for all z, y. We then have the following 
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THEOREM 2.1. The complex C consisting of the totality of center associative 
clements of a given quasi-group S is a group. Its identity element e is both a left 
and a right identity element for S itself. Furthermore, if for c in C, s in S, we 
have cy = 8, and we = 8, then y = c's, and w = sc. Finally, the order of C 
divides the order of S. 

The proof will be made by establishing five lemmas. 

Lemma 1. The complex C is a sub-quasi-group, that is, CC = C. 

To show this, let c, d be a pair of elements in C. We shall prove that cd is a 
center associative element which shows that it lies in C since C is the totality of 
such elements. We have 


(3) (xe)d = x(cd), e(dy) = (cd)y 


for all x, y since c and d are center associative. Since d is center associative we 
have from (3) 


(4) (xe)(dy) = ((xe)d)y = (x(ed))y. 
Also since ¢ is center associative we see that 
(5) (xe)(dy) = x(c(dy)) = x((ed)y). 


We now compare (4) with (5) and find that ed is a center associative element. 
This proves the lemma. 

Lemma 2. The sub-quasi-group C is associative, and is thus by Theorem 1.2 
a group. 

For, since x(cy) = (xc)y for all x, y in S, we have for all c, d, f in C the relation: 
d(cf) = (de)f. Hence Theorem 1.2 applies. 

LemMMA 3. The identity element e of C is both a left and a right identity element 
for 8S. 

Proor. Let ¢ be an element of C, and let x vary over S. Then cx varies 
over S by §1.1 (ii). Since e(cx) = (ec)x = cx for all x we see that e¢ is a left 
identity in S. Similarly, e is a right identity. 

Lemma 4. The equations cy = s, we = s forc in C and sin S have the unique 
solutions y = c's, w = sc’ where c' denotes the inverse of c in C. 

For unique solutions exist by Theorem 1.1. We verify the above expressions 
for them by substitution; thus 


cy = c(c's) = (cc ')s = es = 8 


since ¢', being in C, is center associative and since Lemma 3 applies here. 
The other solution is checked similarly. 

Lemma 5. The order k of C divides the order n of S. 

Proor. We shall show that S may be represented as a sum of non-inter- 
secting complexes Cs; , each of order k, in such a way that each element of S 
lies in one, and only one, such complex. 

Let s; be any element of S not contained in C. Then Cs; contains k distinct 
elements of S. Also Cs; A C = 0, for if not we would have cs; = c’ for c, c’ 
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in C, from which follows s,; = ¢ ‘c’ by Lemma 4; but this contradicts the hy- 
pothesis that s; be not in C. 

Next let se be an element of S not contained in either C or Cs;. Then Cs, 
is of order k with Cs; A C = 0. We have also Cs, A Cs; = 0 since cs, = c's, 
implies s. = (c ‘c’)s; in view of the center associativity of c’. Thus we reach 
another contradiction, namely, s. an element of Cs; . 

Proceeding in this way we show that it is possible to select elements s, , 
8, +--+, 8 from S such that 


S = Cs, + Cse + Cs, 


with Cs; A Cs; = Ofori ¥j. This proves the lemma and completes the proof 
of Theorem 2.1. 
Coroutiary. We may also represent S as a sum of complexes of the form sC. 


2.2. Left and right associative elements. By a left associative element m we 
mean one which satisfies the relation m(xy) = (max)y for all xz, y; and by a 
right associative element r one which satisfies (xy)r = x(yr) for all 2, y. 

We now state a theorem similar to Theorem 2.1 which, though similar, still 
differs in enough ways to make a precise statement desirable. 

THEOREM 2.2. The complex L (or R) consisting of the totality of left (right) 
associative elements of a given quasi-group is a group. The identity element ¢ of 
L(R) is a left (right) identity of S itself. The equation mz = s, m in L, s in S, 
has the unique solution x = m™'s, while yr = 8, r in R, has the unique solution 
y = sr’. Also, the order of L (or R) divides the order of 8S. 

Proor. Much of this proof need not be written out in detail since obvious 
modifications of Lemmas 1-5 will follow here. Certainly nothing need be 
carried through for both L and R. We shall however give the detailed proof 
that LL = L. Thus, let m, n be a pair of elements in L, and z, y any pair in S. 
Then 


m(n(xy)) = (mn) (xy) 
(m(nx))y = ((mn)x)y 


from which we see that mn is left associative and is thus contained in L. Finally, 
in order to apply the methods of Lemma 5 to prove the last part of the theorem, 
we must be careful to form complexes Ls;(s;R) so that we have: 


S = Ls, + Ls, + --- + Ls; 


m((nx)y) 


or 
S = + + sR. 


We turn now to another interesting complex which is a group. 
THEOREM 2.3. Let M be a sub-quasi-group of S, such that any pair of elements 
m, m' in M satisfy the relation 


m(m’x) = (mm’)x 
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— Then M isagroup. Its identity e is a left identity of S. The equation 
mz = s(m < M,8 < S) is solved by x = m''s. Finally, its order divides the 
order of S. 

Proor. Since (mm’)m, = m(m’m) we have by Theorem 1.2 that M is a 
group. Let e denote its identity element. Then m(ey) = (me)y = my so that 
ey = y for all y. Thus e is a left identity in S. The relation m(m™'s) = 
(mm~!)s = es = 8 verifies m~'s as the solution z of the equation mz = s. The 
proof that the order of M divides the order of S is similar to the proof of 
Lemma 5. 

As examples of groups satisfying the conditions for M we mention the groups 
L and C already discussed. 

THEOREM 2.4. When any two of the four groups C, L, R, M exist in S their 
identity elements correspond. 

For if there were two left identities e and e’ we would have er = x = e’x 
which is impossible by §1.1 (ii). Similarly there is only one right identity /f. 
But when both of these exist they are the same since e = ef = f. 


2.3. The following theorems are of some interest in connection with the 
classification of quasi-groups. 

THEOREM 2.5. A necessary condition for the existence, in a quasi-group S, 
of L, (or M), R, or C is the existence of a left identity, a a identity, or both, 
respectively. 

THEOREM 2.6. A left identity e is left associative, a right identity is right 
associative, and an element which is both is left, right, and center associative. 

For if ec = a for all x, then (ex)y = xy = e(zy), so that e is left associative. 
The rest of the proof is similar. 

As a final remark, we observe that the intersections of any two or more of 
C, L, R, M are groups. For their intersections are never vacuous (Theorem 
2.4). Hence by §1.4 (iii) they are sub-quasi-groups. And finally, by argu- 
ment similar to the proof of Lemma 2, they are groups. 


3. Commutativity and normal complexes 


3.1. An element ¢ will be called commutative in S when cx = xc for all z in S. 
If every element of S is commutative in S, then S will be called commutative. 
The results of Section 2, for S commutative, are related by the following 

THEOREM 3.1. In a commutative quasi-group a left associative element is right 
associative and conversely ; furthermore, such elements are also center associative ;° 
thatis, L = R SC. 

Proor. Let b be left associative. Then b(cy) = (bx)y, hence (ry)b = y(bz) 
and thus (yx)b = y(xb) which proves the first part of the theorem if the steps 
be read both as written and backwards. To prove b is also center associative 
(bx)y = (xb)y and compare with (bz)y = b(zy) = b(yx) = (by)x = 
x(by). 


* I have not been able to prove or to disprove the converse. 
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3.2. A complex H contained in S will be called normal in S if for all x we have 
Hx = cH. Itis clear that the following complexes are normal in S: 

i) every complex, if S be commutative; 

ii) any complex whose elements are commutative in S; 

iii) the complex S itself; 

iv) the intersection of two normal complexes if it is not a null-set. 


3.3. We now prove a theorem which demonstrates the existence of a type of 
complex which we shall study closely later. 

THeoreM 3.2. Let H be a sub-group of any two of the groups C, L, or R 
($§2.1-2.3) and let H be normal in S. Then for any pair of elements x, y in S 
we have (Hx)(Hy) = H(xy); and also, H being normal, (xH)(yH) = (ay)H. 

Proor. We give the detailed proof for the case H S (C A L). The others 
are all made similarly. Let small letters h; denote elements of H. Then 


(hoy) = hi(x(hey)) since is left associative; 
= h,((xhe)y) since he is center associative; 
= h,((h3sx)y) since H is normal; 
= h,(hs3(xy)) since hz is left associative; 
= (hyh3)(xy) since hy is left associative. 


This means that the product (Hx)(Hy) S H(azy) but by §1.3 (i) the inclusion 
sign is not possible. Hence the theorem follows. 


4. Invariant complexes and quotient quasi-groups 


4.1. A complex H is called invariant in S if, for all a, b in S there exists a 
corresponding element c such that (Ha)(Hb) = Hc; and (as we shall prove in 
Theorem 4.9 and below) an element d such that (aH)(bH) = dH. 

We observe at once that such complexes exist in view of Theorem 3.2 in 
which we showed the existence of a complex H with properties more special 
than our definition requires, for there He = H(ab) and a < Ha for all a were 
also true. We shall consider these cases later. 

Until after the proof of Theorem 4.9 we shall distinguish between the two 
cases of our definition by using the terms left and right invariant. We turn 
now to a study of the properties of a left invariant complex in an arbitrary 
quasi-group. 

THEorEM 4.1. If H be left invariant and if (Ha)(Hb) = (Ha)(Hc), then 
Hb = He; and if (Hb)(Ha) = (Hc)(Ha), then also Hb = He. 

For since Ha, Hb, He are all of the same order by §1.3 (i), the theorem 
fellows by §1.3 (v). 

THEOREM 4.2. Hach element b of S lies in some complex Hz. 

Proor. Form Hz; for all 2; in S. We get thereby HS = S by §1.3 (i), 
so that b cannot be omitted from every complex. 

THEOREM 4.3. For any p in S, (Ha)p is some complex Hb. 

For we saw in Theorem 4.2 that p lies in some complex Hm. Then by 
§1.3 (iv), (Ha)p = (Ha)(Hm) = Hb. 
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TurorEM 4.4. Let a, b denote any pair of elements of S. Then either 
Ha A Hb = 0 or Ha = Hb. 

Proor. Assume Ha ¥ Hb so that two cases are possible: (i) Ha A Hb = 0, 
and the theorem is proved; (ii) Ha A Hb = T + 0, T proper in Ha and Hb. 
In this case we have (Ha)T < (Ha)(Ha) (i.e., S He for some c). Now by 
$1.3 (i) the order of (Ha)T is not less than the order of Ha but from the previous 
step the order of (Ha)T is not greater than the order of He. Since Ha and He 
are of the same order we have (Ha)7 = (Ha)(Ha); similarly (Ha)T = 
(Ha)(Hb), and thus by Theorem 4.1, Ha = Hb. 

TuroreM 4.5. The elements of H itself all belong to some single complex Ha. 

Proor. If the theorem were not true, then for some h and h’ in H we would 
haveh < Ha, h’ < Hb, Ha A Hb = Ofor somea, b. Let now k be any element 
of S. Then hk < (Ha)k, h’k < (Hb)k with (Ha)k A (Hb)k = 0 by §1.3 (ii). 
Now by Theorem 4.3, (Ha)k and (Hb)k are complexes of the form Hp, Hq. 
We have then shown that hk < Hp and h’k < Hq where Hp A Hq = 0. When 
we consider that both hk and h’k lie in Hk we reach a contradiction of Theo- 
rem 4.4. 

In regard to the relation existing between the set of complexes Hb and the 
elements b which determine them we have 

THEoREM 4.6. If Ha = Hb, then a and b both lie in the same complex He, 
and conversely. 

Proor. Assume a < Heandb < Hd. Then Ha = H(Hc) and Hb = H(Hd) 
(Theorem 4.5); hence H(Hc) = H(Hd) so that He = Hd by Theorem 4.1. 
Conversely, if a < He and b < He, then Ha = H(Hc) = Hb. 


It is now clear, in view of Theorems 4.4-4.6, that elements a; , a2, --- , @, 
exist in S such that 
(6) S = Ha, + Ha. + --- + Ha, (Ha; A Ha; = 0,7 ¥ 9), 


in which we may without loss of generality take Ha; = H. Also observe that 
the decomposition (6) is not altered if any a; be replaced by any other element 
of S which lies in the same complex Hz with it (Theorem 4.6). 

THEoreM 4.7. The set of complexes Ha; of (6) forms a quasi-group with 
product of complexes as the ‘‘product’’ of §1.1 (i). 

For the product of two complexes is a complex since H is left invariant; thus 
§1.1 (i) is satisfied. The result of Theorem 4.1 satisfies §1.1 (ii). 

Derinition. The group whose existence is proved in Theorem 4.7 is called 
the quotient quasi-group of S by H and is denoted by S/H (tentatively by 
HS until after Theorem 4.9). 

THEoreM 4.8. Let H be an invariant complex. Then for any a in S, Ha is 
also an invariant complex. 

Proor. Consider the product ((Ha)b)((Ha)c) which by Theorem 4.3 takes 
the form (Hp)(Hg) = Hr. Now in H™'S we solve the equation (cf. Theorem 
1.1) (Ha)X = Hr for the complex X. Let d be an element of X. Then Hr = 
(Ha)d. We then see from the discussion above that ((Ha)b)((Ha)c) = (Ha)d 
so that Ha is invariant. 
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We are now in a position to show that a left invariant complex is also right 
invariant and thus to justify the parenthetical remark in our definition of an 
invariant complex. 

TuroreM 4.9. If b is any element of S, the complex bH is also of the form Hp 
for some p in S, and conversely. Thus a left invariant complex is right invariant 
and conversely. 

For by Theorem 4.2, b lies in some complex Hz; then by §1.3 (iv), bH = 
(Hzx)H, so that by Theorems 4.5 and 4.7,bH = Hp where Hp = (Hz)H in H''S. 

Conversely, Hp is of the form bH since in H™'S we can solve the equation 
Hp = XH for the complex X by Theorem 1.1. Let 6 be any element in X; 
then Hp = bH by §1.3 (iv). 

It is now clear that (b6H)(cH) = (Hp)(Hq) = Hr = dH which proves the 
theorem. 

Obvious analogues of Theorems 4.1-4.6 show that a decomposition of S of 
the form 


analogous to (6) is possible. Clearly, the b:;H of (7) are the same as the Ha; 
of (6) in some order in view of Theorems 4.9 and 4.4. 

We conclude this section with three theorems on quotient quasi-groups. 

TuHEorEM 4.10. Let K be a sub-quasi-group of a quasi-group S, and let H 
be an invariant complex contained in K. Then H is invariant in K, and K/H 
is a sub-quasi-group of S/H. 

Proor. Since H < K, the complexes Hk, , Hke consist of elements of K; 
thus their product (Hk,)(Hke) consists of elements of K. Since now H is 
invariant in S, there exists an element c in S such that (Hk;)(Hkz) = He. We 
shall prove that c lies in K. Suppose this were not so. Then K + c isa 
complex of greater order than K. Then H(K + c) is of greater order than K 
by §1.3 (i). This contradicts the fact that H(K +c) <s HK + He & K since 
HK #S K and He Ss K from the preceding discussion. Thus # is invariant 
in K, K/H exists and forms a sub-quasi-group of S/H by §1.4 (i). 

THEOREM 4.11. Let Q be a sub-quasi-group of S/H. Then the set Q’ of ele- 
ments of S belonging to the complexes which lie in Q forms a sub-quasi-group of 8. 

Proor. Let Q consist of complexes Hq , Hq: , --- , Hq: where , % 
are a subset of the a; in (6). Then Q’ is the complex Hq + --- + Hq in S. 
Let a’, b’ be elements of Q’. Then for some i, j we have a’ < Hq;, b’ < Hqi 
and a’b! < (Hqi)(Hq;) = Ha since QQ = Q. Thus a’b’ < Q’, and Q’ isa 
sub-quasi-group by §1.4 (i). 

TuroreM 4.12. Let Q be an invariant complex in S/H. Then the set Q' of 
elements of S belonging to the complexes which are contained in Q forms an in- 
variant complex in S. 

Proor. If Q be invariant in S/H, then a complex Ha, exists in S/H such 
that (Q(Ha,))(Q(Ha;)) = Q(Ha). If in S, a’ < Ha;, then 
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= (Hm + --- + Hq,)a’ 
= (Hq)a’ + --- + (Hq)a’ 
= (Hq)(Ha;) + --- + (Hq,)(Ha,) by §1.3 (iv) 
= Q(Ha;,). 


If also b’ < Ha; and c’ < Ha, , then by repeating the argument just made we 
have (Q’a’)(Q’b’) = Q’c’. Thus Q’ is invariant in S. 


4.2. Homomorphism of a quasi-group. The quotient quasi-group defined in 
§4.1 is readily seen to be an example of a system S satisfying the following 

Derinition. Let S denote a set of elements a; , a2, - - - satisfying the closure 
property (§1.11) of a quasi-group with respect to some operation. Let S be 
any quasi-group such that there exists a correspondence a — a from S to § 
with the two properties: 

i) to each a in S there corresponds one and only one @ in S; 

ii) no @ in S is without a correspondent in S. 
We call the set S a homomorph of S when the correspondence satisfies 

iii) if a > a, b > 6, then ab — ab. 
If the correspondence is one-to-one S is called an isomorph of S and S and S 
are said to be tsomorphic. 

In the discussion of homomorphism of a quasi-group we shall need the fol- 
lowing 

Lemma 6. Let Hy, --- , bea set of complexes in S such that 


and such that for all i, 7 we have 
(9) HH; 


for some k. Then the complexes H, are all of the same order, and H;H; = Hy. 

Proor. We assume the lemma false. Then without loss of generality we 
may take H,, --- , H, to be of maximal order d, and H,4;, --- , H, to be of 
order less than d. Consider now the set of complexes HiH;, i = 1, ---,7. 
By §1.3 (i) their orders are not less than d, and thus by (9), since d is maximal, 
their orders are d. They are distinct since HiH; = H,H; implies H; = H; 
by §1.3 (v). Thus they are Hi, --- , H,in some order. Consider now H,H,4: , 
which by §1.3 (i) is also of order d. This means, in view of (9), that 


(10) = 


for some j from 1, --. ,r. Now let h be an element of H;. Then by §1.3 (iv) 
wehave hH; = H,H;if1 <j <r. Thus from (10) hH; which implies 
that hH,,, A hH; # 0. This is seen to be a contradiction of (8) in view of 
§1.3 (ii). Thus the lemma is true. 
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TurorEM 4.13. A homomorph S of a quasi-group S is a quasi-group. 

Proor. Let H; denote the complex composed of those elements of S which 
correspond to d; in S. Then by (i) Hi A H; = Oif t ¥ j. Now if in §, 
aa; = &, then HH; S H;, since every element a,a; of H;H; is mapped on 
a, by (iii). By the lemma just proved we see that the H; are all of the same 
order and that HjH; = H,. We now show that 8 is a quasi-group. First, 
it satisfies §1.1 (i) by definition. Second, it satisfies §1.1 (ii) also since if 
Ga; = dd, in S for j ¥ k then in S we would have HiH; = H,H, for H; A 
H;, = 0, which contradicts §1.3 (v). 

We have already remarked that the quotient quasi-group S/H is a homo- 
morph of S. The correspondence is b — Ha; if b < Ha; and “product” 
in S/H is the complex product (Ha,;)(Ha;). We now show that the converse 
is true. 

Tureorem 4.14. Corresponding to any homomorph 8 of a quasi-group 8 
there exists in S an invariant complex H such that S/H and S are isomorphic. 

Proor. We saw in Theorem 4.13 that the elements of S fall into disjoint 
classes H; of equal orders such that the correspondence H; <> 4; is one-to-one 
and the H; form a quasi-group isomorphic with S. Thus Hi, HiH2, ---, 
HH, are all distinct (§1.1 ii) and equal to H;, , Hi, , --- , Hi, where the sub- 
scripts 71, 72, ---,% are a permutation of 1, 2,---,¢. Now the relation 
H;,; = H,H; = Ha; (where a; is any element of H;) shows that the H; are all 
of the form Hia;. Then obviously (Hia)(Hib) = Hic for some c and HA, is 
invariant by §4.1. 


4.3. Invariant complexes with special properties. We have mentioned the 
existence (§4.1) of invariant compléxes H such that 


(11) (Hx)(Hy) = H(cy) 


for all z, y. We now ask: What is special about the homomorph S which cor- 
responds to an invariant complex H satisfying (11)? The answer is found in 
the following 

THEorEM 4.15. A necessary and sufficient condition that there exist an in- 
variant complex H in S satisfying (11) is that there exist a homomorph S of 8 
containing an element a such that 
(12) da;)(aa;) = 
for all i, j and such thata > difa < H. 

Proor. The condition is necessary since S/H is a homomorph for 
which, if x < Ha, y < Hb, the relation (Hz)(Hy) = H(ay) in S implies 
(H(Ha))(H(Hb)) = H((Ha)(Hb)) in S/H. Thus in S(= S/H) we have 
a(= H) with the property required by the theorem. 

The condition is sufficient for if H, H; , H; denote the classes of elements in S 
which correspond to 4d, a; , 4; , respectively then (Theorem 4.13) 


(13) (HH;)(HH;) = H(HiH;) 
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for all i, 7. Let now 2, y be any elements of S. If x < Hi, y < H;, then 
ry < HH; so that by §1.3 (iv) we have (Hz)(Hy) = H(zy). 

CorotLtary 1. A necessary condition for (12) is that a be idempotent in 8, 
that is, that @ = a. 

For (12) implies (aa@)(4@) = a(aa) so that (§1.1 ii) @ = a. 

CoroLtaRy 2. A sufficient condition for (12) is that a be a left identity in 8. 

For if da; = 4; for all 7, then (G@;)(@a;) = Ga; = 

As an immediate consequence of Corollary 1, we have 

TurorEeM 4.16. If an invariant complex H satisfies the relation (Hx)(Hy) = 
H(ay) for all x, y in S, then H is a sub-quasi-group, t.e., HH = H. 

And from Corollary 2 follows 

TurorEeM 4.17. An invariant complex H such that 


(14) H(Ha) = Ha 


for all a in S also satisfies the relation (11). 

We shall require the following theorem later. 

TuroreM 4.18. If H be an invariant complex such that H(Ha) = Ha for 
all a, then a < Ha and conversely; and either of these conditions implies (Ha)b = 
H(ab) for all a, b in S. 

Proor. Let H(Ha) = Ha. Supposea < Hb; then by §1.3 (iv), Ha = H(Hb) 
so that by Theorem 4.1, Hb = Ha. Conversely, if a < Ha then Ha = H(Ha). 
Finally, if b < Hb then (Ha)(Hb) = (Ha)b = H(ab) by the first part of this 
theorem and by Theorem 4.17. 

Corottary. H(Kb) = (HK)b for any complex K. 

We now consider briefly the relation 


(15) (cH)(yH) = (xy)H. 


Theorems analogous to Theorems 4.15-4.18 may be proved for this case if the 
order of all products Hx used in them be reversed. The following theorem 
relates the two types (11) and (15) of invariant complexes. 

THEorEM 4.19. If H be invariant in S and if 


H(Ha) = Ha = (Ha)H 


for all a, then H is normal in S (§3.2) and thus satisfies both (11) and (15). 

For we have a < Ha by Theorem 4.18. Then (Ha)H = aH by §1.3 (iv), 
so that by our hypothesis aH = Ha for all ain S. Thus H is normal in S. 
The rest of the theorem is obvious. 


4.4. Relations between invariant complezes. 

THEorem 4.20. Let K be an invariant complex such that K > H. If H 
satisfies (11) so does K or if H satisfies (14) so does K. 

Proor. Since K is invariant (Ka)(Kb) = Ke for some c. Now Ha < Ka 
and Hb < Kb. Thus H(ab) = (Ha)(Hb) < Ke. We have also H(ab) < K(ab). 
Thus Ke A K(ab) # 0. Therefore by Theorem 4.4, Ke = K(ab). This 
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proves the first part of the theorem. The latter part follows from the fact that 
ifa < HaandH < K, thena < Ha < Ka. 

TuroreM 4.21. If H and K both satisfy (11) then their intersection D, if not 
vacuous, also satisfies (11). And if H and K satisfy (14) then D is not vacuous 
and satisfies (14). 

Proor. Since (Da)(Db) < (Ha)(Hb) we have (Da)(Db) < H(ab). Simi- 
larly (Da)(Db) < K(ab). But since D = H A K we have from §1.3 (ii) 
D(ab) = H(ab) A K(ab). Thus (Da)(Db) S D(ab) with the inclusion sign 
impossible by §1.3 (i). Hence (Da)(Db) = D(ab). 

If a < Ha,a < Ka, thena < (H A K)a = Da. NowD = (H A K) =0 
is impossible since then Ha A Ka = 0 which contradicts the fact that both 
contain a, 

THEoreM 4.22. If H satisfies (14) and K satisfies (11), then HK = M satisfies 
(11) and K < M. Also if K satisfies (14) so does M. 

Proor. We have (Ma)(Mb) = ((HK)a)((HK)b) = (H(Ka))(H(Kb)) by 
the corollary to Theorem 4.18. Now (H(Ka))(H(Kb)) = H((Ka)(Kb)) since 
H satisfies (11) also. But 


H((Ka)(Kb)) = H(K(ab)) = (HK)(ab) = M(ab). 


Thus M satisfies (11). Nowsincek < Hkforallk < K wehave K < HK = M. 
Finally, if K satisfies:(14), a < Ka for anya. Thus from the preceding steps 
we have a < H(Ka) = Ma. 


4.5. We now give a brief discussion of maximal invariant complexes and 
simple quasi-groups. 

Derinition. An invariant complex is called maximal in S if it is contained 
in no other invariant complex except S itself. 

DEFINITION. A quasi-group is called simple if it contains no invariant com- 
plexes (that is, none consisting of more than one element of S). 

THEOREM 4.23. If H be a maximal invariant complex then S/H is simple. 

Proor. Let S/H contain an invariant complex. If this complex does not 
contain H, then one of its co-sets does. But by Theorem 4.8 this co-set is an 
invariant complex. Hence H is not maximal in S by Theorem 4.12. This 
contradicts our hypothesis. 

TuroreM 4.24. If H and K are permutable (that is, HK = KH), if both 
satisfy (14), and if both are maximal, then if we denote HK by M and H \ K 
by D we have M = S and S/H isomorphic with K/D, S/K isomorphic with H/D. 

Proor. By Theorem 4.22, M is invariant and contains H and K since they 
are permutable. Now this means that M = S since H and K are maximal. 
By Theorem 4.21, D # 0 and satisfies (11). Since D < K and (Dk;)(Dks) = 
D(kikz) we see that D is invariant in K (that K is a quasi-group follows from 
Theorem 4.16). Thus as in (6) we have 


(16) K = Dk, + + + Dky. 
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Then 
(17) HK = H(Dk,) + H(Dkz) + + H(Dk;) 
= (HD)k, + (HD)kz + + (HD)k, 
by the corollary to Theorem 4.18. Now HD = H since HD s H (for D < H) 


with the inclusion sign impossible by §1.3 (i). Now from (17), since we have 
already proved HK = S, we have 


(18) S = HK = Hk, + Hk, + .-. + Hky. 


We compare (18) with (16) and find that the correspondence Hk; < Dk; is an 
isomorphism between S/H and K/D since both H and D satisfy (11). 
The other part of the theorem follows similarly. 
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ANNALS OF MATHEMATICS 
Vol. 41, No. 3, July, 1940 


UBER EINE ALLGEMEINE THEORIE DER FUCHSSCHEN GRUPPEN 
UND THETA-REIHEN 


Von Masao SuGAWARA 


(Received June 6, 1939) 


Die Theorie der elliptischen Modulfunktionen ist kiirzlich von Herrn C. L. 
Siegel auf den Fall verallgemeinert worden, der sich auf allgemeinen Typus der 
Riemannschen Flichen mit dem Geschlecht n bezieht.’ Entsprechend méchte 
ich hier die Theorie der Poincaréschen automorphen Funktionen’ auf “héher- 
dimensionale”’ Fille iibertragen. Wir fiihren den Begriff der allgemeinen Fuchs- 
schen Gruppen ein, und beweisen dafiir den Diskontinuttdtssatz, auf Grund 
dessen die Konvergenz der fiir diese Gruppen konstruierten Theta-Reihen gesi- 
chert sein wird. Die Siegelsche Modulgruppe steht in enger Beziehung mit 
einer gewissen Fuchsschen Gruppe. Insbesondere folgt die Existenz des Funda- 
mentalbereiches fiir die Siegelsche Gruppe direkt aus unserem Diskontinui- 
tiitssatz.° 


I. Bezeichnungen und Vorbemerkungen 


Es handelt sich im folgenden hauptsichlich um n-dimensionale quadratische 
Matrizen mit komplexen Elementen. Mit grossen lateinischen Buchstaben 
bezeichnen wir meistenfalls solche Matrizen. Schreiben wir etwa 


U; U: F 
U= bzw. X= 


so bedeuten dabei U; , U2, Us , Us, P, Q immer n-dimensionale Matrizen; U ist 
die daraus gebildete 2n-dimensionale Matrix, und X eine Matrix vom Typus 
(2n, n). Mit EF bezeichnen wir die (n-dimensionale) Einheitsmatrix. Die 
transponierte bzw. konjugiert-komplexe Matrix von einer Matrix A sei mit A’ 
baw. A bezeichnet. 

Ist H Hermitesch, und ist die zugehérigen Hermitesche Form positiv definit, 
so schreiben wir H > 0. Sind H, , Hz. Hermitesch und H, — Hz > 0, schreiben 
wir auch H; > Hz. Da offenbar mit H; > 0, H2 > 0 auch H,; + Hz > 0 gilt, 
kann man mit diesen ‘‘Ungleichungen” wie iiblich rechnen. Ist A reguliir, d. h. 


1C. L. Siegel: Analytische Theorie der quadratischen Formen. Ann. of Math. 36, 
Insb. §13. 

*H. Poincaré: Acta Math. I, II, III. Q&uvre, t. II. Vgl. auch. H. Weyl: Die Idee der 
Riemannschen Fldache, insb. §§20-21. 

* Bei der Redaktion dieser Arbeit ist mir Herr S. Iyanaga behilflich gewesen. Ich. 
méchte ihm auch an dieser Stelle meinen besten Dank aussprechen. 
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ist die Determinante | A | # 0, so gilt stets A'A > 0. Ist umgekehrt H > 0, 
so lisst sich H in der Form H = A’A mit einem reguliren A darstellen. 

Ein Paar von Matrizen P, Q heisst nach Siegel* symmetrisch, falls P’Q = Q’P 
gilt. Ist dabei Q reguliir, so ist die Matrix PQ” symmetrisch. Umgekehrt 
lisst sich jede symmetrische Matrix Z in der Form. Z = PQ", |Q| # 0 
schreiben, wobei P, Q ein symmetrisches Matrizenpaar bilden: es geniigt ja 
etwa P = Z,Q = E wu setzen. 


II. Raum, Grundgebiet und Bewegungen 


Die Gesamtheit aller symmetrischen Matrizen Z bildet unsern Raum %. 
Jedes Z heisst ein Punkt von ®. Wir wollen uns dabei ® als im n (n + 1)- 
dimensionalen Euklidischen Raum eingebettet denken: ® ist also metrisch und 
als Entfernung p(Z, Z°) zweier Punkte Z = (2%) = (ax + iy) und Z° = (z%,) = 
(rt + gilt die Grosse — ah)” + (yx — yix)*)|’. Die Menge 
% der Punkte Z = (zx), so dass E > Z’Z gilt, nennen wir das Grundgebiet. 
% ist ersichtlich eine offene Menge in 8. % ist ferner beschrinkt. 

Aus E > Z’Z folgt naimlich, dass die Elemente auf der Hauptdiagonale von 
E — ZZ positiv sein miissen, d. h. | |? < 1,7 = 1,2,---,n. 


Nun setzen wir 
0 0 
5): 


Die 2n-dimensionalen (reguliren) Matrizen U, die den Bedingungen 
(1) U'JU = J, (2) U'SU =S8 
geniigen, bilden eine Gruppe T. Sei U ein Element von T. Setzen wir U = 


J 
und betrachten wir die Substitution 


(3) = (UiZ + Us)(UsZ + 


Satz 1. Die Substitution (8) ist fiir ale Z ¢ A sinnvoll und bildet A in sich ab. 

Demnach nennen wir die Substitution (3) eine Bewegung von A, die von 
U el induziert wird. 

Zum Beweis schreiben wir Z = PQ™' mit ein symmetrisches Paar bildenden 


Matrizen P, Q, | Q| ¥ 0 und setzen X = (6): X, = UX, Xi = (op Nach 


(1), gilt 
XiJX, = X'U'JUX = X'JX, 


‘Siegel a. a. O. §12. 
*Ubrigens ist % auch konvex. Aus E > Z,Z,, und E> Z,Z;, folgt naimlich E > 


Zi+ Ze 
wegen 


(£+8) = 2(E — + UE — + — — 
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oder — = P'Q QP =0. 
D. h. P; , Q: bilden ein symmetrisches Matrizenpaar. Aus (2) folgt ahnlicher- 
weise 


X,SX, = X’U'SUX = X'SX, 
oder PiP, — = P’P — Q’0. 


Da nun E > ZZ mit Q’Q > P’P gleichbedeutend ist, folgt QiQ, > PiP,. Ist 
andererseits Qig = 0 mit n-dimensionalem Vektor ¢, so auch g’Q; = 0 und (*) 
liefert durch linksseitige Multiplikation mit g’ und rechtsseitige Multiplikation 
mit @ die Gleichung 
(Piy)'(Pig) = — Q'Q)e; 
i die linke Seite dieser Gleichung ist = 0, wihrend die rechte Seite < 0 ist; also 
| (P’P — Q'Q)e@ = 0. Mithin = 0, weil P’P — Q’Q < 0, daher ist | Q, | 0. 
Die Substitution (3) ist also sinnvoll fir Ze % und Z, = P,Qi’. Hieraus 
folgt dass mit Z % auch Z; ¢ gilt. 
Satz 2. Jedes U eT hat folgende Gestalt. 


_ (Ui 
wobei gill: 
(5) = U3U, d. h. und Us bilden ein symmetrisches Matrizenpaar; und 
(6) Ui0, — = E. 


Umgekehrt gehért jede solche Matrix der Gruppe T an. 
Brewels. Eliminiert man U’ aus (1), (2), so bekommt man —J = 
oder U = —SJUJS, oder auch, ausgeschrieben 


Us Us 

Us U, Uz 

Daraus folgt, dass U die Form (4) hat. Setzt man das in (1) oder in (2) ein 
und rechnet man aus, erhalt man die Bedingungen (5), (6). Umgekehrt 
bestitigt man leicht, dass eine Matrix U von der Form (4) mit den (5), (6) 


gentigenden U, , die Bedingungen (1), (2) erfiillt. 
Die Bewegung (3) schreibt sich hiernach in der Form 


(3’) Zi = (UiZ + + 01)", 


wo U;, Us (5), (6) geniigen. Soll insbesondere eine Bewegung den Nullpunkt 
(d. h. die Nullmatrix) O festlassen, so muss in der Darstellung (3’) UiU: = 
a (d. h. U; unitar), und Uz = O sein. Setzt man nimlich Z = Z, = 0 in der aus 
4 (3’) folgenden Formel Z,(U2Z + U;) = UiZ + U2, erhalt man U2 = 9; 
= E folgt dann aus (6). Umgekehrt eine soleche Substitution ersichtlich 
gentigt die Bedingung. 


aa 
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Lisst eine Bewegung sogar alle Punkte von YI fest, d. h. ist sie die identische 
Bewegung von %{, so muss ferner (’; cine diagonale Matrix mit den Diagonal- 
elementen £1 sein. Dann gilt niimlich Z = U,\ZUy' oder ZU, = U,Z fir alle 
Ze%. Also ist U; reell und mit allen Z ¢ Y% vertauschbar, woraus leicht die 
Behauptung folgt.—Die aus solechen U; und U2 = 0 gebildeten U bilden einen 
Normalteiler Ty mit der Ordnung 2” von T. Die Gruppe & aller Bewegungen 
ist >T/To. 

Sei nun A ein Punkt von %. Man kann eine Bewegung, die A in O iiber- 
filhrt, folgendermassen konstruieren: Man setze E — A’A = C’'C, (C’)' = C, 


und 
0 G/\-A E -GA J’ 
Dieses Us gehért zu: es hat ja die Form (4) und die Bedingungen (5), (6) 


sind fir = C,, Uz = erfiillt, wie man rechnerisch verifizieren kann.° 
Die entsprechende Bewegung 


LZ = Ci(Z A)(E 


fiihrt offenbar A in O iiber. Hieraus folgt der 

Satz 3. Die allgemeine Bewegung von A, die einen Punkt A € A in einen 
Punkt B Xiiberfithrt, wird durch das Element UZ" Us (U 1U, = 8), 
induziert. Sie wird also durch folgende Formel gegeben: 
(7) Dj(Z, — B)(E — BZ,)"D;' = UiC\(Z — A)(E — 


wobet C; die friihere Bedeutung, und D, die entsprechende Bedeutung fiir B hat. 
Liisst man hier U, alle unitére Matrizen durchlaufen, bekommt man alle solche 
Bewegungen.’ 

Die Bewegung (7) wollen wir kurz mit (A, B, U;) bezeichnen. 


III. Allgemeine Fuchssche Gruppe 
Die Bewegungsgruppe % ist eine topologische Gruppe: die Bewegungen 
Uf?) und (Ae, Be, liegen nahe bei einander, wenn sich die 
Matrizen A,, B,, U{” baw. von Az, Be, U{? nur um wenig unterscheiden. 
Kine Untergruppe G von B nennen wir eine allgemeine Fuchssche Gruppe, wenn 
sie keine infinitesimale Bewegung enthilt, d. h. wenn das Einselement (mithin 
auch jedes Element) in der Gruppenmannigfaltigkeit isoliert liegt. 


° (5) besagt: C}C,A = AC{C,, oder AC’C = C’CA. Das folgt aus A(E — AA) = 
(E — AA)A, indem man A’ = A beachtet. (6) besagt: C;@: — AC;CiA = E oder 
(C’'C) — A(C’)1A = E. Das folgt aus E — AA = E — A(E — AA)“ A(E — AA). 

"Setzt man Z = A + dA in (7), und lasst man dA nach O riicken, findet man, dass ds? = 
Sp dA(E — AA)~ dA’(E — AA)- bewegungsinvariant ist (indem man die Glieder mit 
(dA)? vernachlissigt). So kann man % als einen Riemannschen Raum mit diesem ds? 
auffassen. In dieser Note machen wir aber davon noch keinen Gebrauch. Vgl. R. Nevan- 
linna: Kindeutige analytische Funktionen §1. 
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Satz 4. (Diskontinuitdtssalz.) Jede allgemeine Fuchssche Gruppe ist eigent- 
lich diskontinuierlich in A. Ist néimlich A ein beliebiger Punkt von A, und sind o; 
die Elemente von einer allgemeinen Fuchsschen Gruppe ©, so weist die Punktmenge 
{o:A} keinen Héufungspunkt in % auf. 

Brewers.» Angenommen, die Punktmenge {¢;A} besitze eine Haufungsstelle 
in A. Setzen wir einfachheitshalber o;A = A;. Fir ein beliebiges « gibt es 
dann ein gewisses 7, sodass 
(8) p(A;, Ar) < 
fir unendlich viele Werte von k gilt. Die Bewegung o,0;' ¢ G fiihrt den Punkt 
A; in A, tiber; wir konnen also = (Ai, Ar, U {”) mit einem gewissen 
unitiren U{® schreiben. Da aber die Gruppe aller n-dimensionalen unitiiren 
Matrizen offenbar kompakt ist, lasst sich eine Teilfolge von U{" so wihlen, dass 
sie konvergiert. Wahlt man also ein geeignetes Paar 1, m der Werte von k aus 
dieser Teilfolge, so unterscheiden sich die Matrizen Uj? und U{” beliebig wenig 
von einander. Wegen (8) liegen aber auch A;, A» beliebig nahe bei einander. 
Die Bewegung 


10m = (010% = (Ai, Ar, Ut”)(Ai, Am, 
wire dann also infinitesimal, was unserer Voraussetzung widerspricht. 


IV. Theta-Reihen 


Es sei & eine allgemeine Fuchssche Gruppe. Ein Element o ¢ @ sei von der 
Form: 


o(z) = (UiZ + Ue2)(U3Z + 
Die Theta-Reihen fiir G definieren wir dann durch 
@,(Z) = |UsZ + (k = 2). 


Satz 5. Die Rethe O,(Z) ist absolut und gleichméssig konvergent in der Umge- 
bung von Ze Sie stellt also eine in reguldre analytische Funktion von zix 
dar, und es gilt: 


O,(o(z)) = | + Us |*"*O,(Z). 


Bewets. Es geniigt offenbar die Konvergenz der Reihe zu zeigen. Es sei 
Z° = (zn) und B = (Z;| zn — | <r)? 

r > 0 sei dabei so klein gewihlt, dass alle Gebiete cB, o « G, zu einander 
fremd liegen. v(%) sei das Euklidische Mass von %, welches offenbar positiv ist. 

Wegen der Beschranktheit von ist dann v(¢B) konvergent. Anderer- 


seits gilt v(oB) = [ IdZ, wol = a) den absoluten Betrag der Funk- 


® Vgl. E. Cartan: Lecons sur la géométrie projective complexe, §78. Den Hinweis auf 
diesem Buch verdanke ich Herrn Iyanaga. 
B(Z; | — | <r) bedeutet die Menge derjenigen Z, fiir welche | zi — zik | <7 
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tionaldeterminate fir die Bewegung o bedeutet. Man setze nun P; = 
UZ + Us, = Us% + Ussodass Z, = P:Qr', Pi = also dP; = dZ,Q, + 
7,dQ,. Daher folgt Qi1dZQ: = QidP: — = QidP, — PidQ = 
QiU,dZ — P{U;dZ = dZ, letzteres wegen (4), (5), (6) Satz 3. Man hat also 
dZ, = Qi 'dZQz". Daraus findet man wie Siegel als Werte von J, J = 

Hieraus folgt 


d 
|| UsZo + (ar?) 


also folgt die Konvergenz der Reihe @2(Z). Daraus folgert man leicht die 
absolute und gleichmiissige Konvergenz von @,(Z) fiir | z% — zi. | < 4r. Man 
setze nun S = || UsZ + Us Dann gilt || + Us|"? < st, 
mithin || UsZ + Us|[*@*? < || + far k > 2. Also 
konvergiert @,(Z) auch fiir k > 2 absolut und gleichmissig. 


V. Modulgruppe 


Die von Siegel eingefiihrte Modulgruppe 2 ist die Gruppe aller reguliren 
2n-dimensionalen Matrizen mit ganz-rationalen Elementen, die der Bedingung 
setzt man X = (x%), Y = (yx), sodass Z = X + 7Y, dann fiihrt die Substi- 
tution Z; = (AZ + B)(CZ + D)™ ein Z mit Y > 0 in ein ebensolches Z; = 
X, + 7Y; mit Y; > Oiiber. Die Gesamtheit © aller Z = X + 7Y mit Y > 0 
nennen wir die Szegelsche Halbebene. § bleibt also gegen Siegelschen Modul- 
substitutionen invariant. | 


Mittels der transformation 
W = (Z —iE\(Z +7 


wird nun § auf unser Grundgebiet % homéomorph abgebildet, wie folgende 
leicht verifizierbare Identitat zeigt: 


W'W — = -2(2 + + iz)". 
1 (E -# 
ik 


v(oB) = 


Setzt man entsprechend 7 = ,U = TMT" fir M eM, so 


_™ || A || bedeutet den absoluten Betrag der Determinante von A. C. L. Siegel: Uber 
die Zetafunktionen indefiniter quadratischer Formen II. Math. Zeit. 44, 8. 403. 
"Man erhilt dies nach wiederholter Anwendung der Formel fiir eine analytischen 


r 
Funktion f einer Variablen, f(¢)r? = [ i J(s + re*) dr dé, die sich aus der Integral- 
0 40 
darstellung von Cauchy ergibt. 
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folgt U eY. Da ferner M offenbar keine infinitesimale Transformation ent- 
halt, entspricht der Gruppe der Siegelschen Substitutionen eine allgemeine 
Fuchsseche Gruppe. Nach unserem Satz 4 besitzt sie also einen Fundamental- 
bereich in der Siegelschen Halbebene. 


Mecuroxu Komasa Daticut Koro-Gaxkko, 
Toxyo, JAPAN. 


Aus T’JT = J folgt U'JU = M'T'JTMT™ = TY’M'IMT™ =J. Aus T'ST = iJ 
folgt U'SU = = iT-YM'TMT™ = S. 
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FINITELY ADDITIVE INTEGRAL 
By S. BocHNER 


(Received November 16, 1939) 


H. Freudenthal’ has shown that under proper assumptions a partially ordered 
vector space C can be realized by measurable (countably additive set) functions 
ona properly construed Boolean algebra with a Lebesgue measure. His assump- 
tions are that any countable set in C which is bounded from above (below) has 
a l.u.b. (g.l.b.); that there is another limit in the space whose topological impli- 
cations are practically those of a Banach norm; and that there exists a positive 
element F = 1 no part of which is 0; precisely, if g > 0 then inf (g, 1) > 02 

(Generalizing a previous construction’ we will establish an analogous result 
for the finitely additive case. We will completely eliminate the assumption 
that infinite sets of element which are bounded in the sense of order have an 
inf or sup respectively. Instead of the limit topology we will more specifically 
assume the existence of a functional with the properties of an integral, but it 
will be only finitely additive. However our assumption concerning the element 
f = 1 will be rather restrictive; we will assume that every other element g is 
bounded from above and below by a numerical multiple of 1. 

Our construction is given in section IV after a preparatory step in section III. 
Sections I and II contain definitions and obvious analogues to other results 
given in “A.” Furthermore we have appended to each theorem a generaliza- 
tion bearing on the case in which the integral or measure involved is not a 
number but an element of another vector space. It is worth pointing out 
that the conditions which have to be imposed on that vector space differ seri- 
ously from case to case. 


I. Definitions 


A partially ordered space C has the following properties: 

1) C is an Abelian group with respect to addition and with real numbers as 
coefficients (operators). 

2) C is partially ordered (relation >) and (i) corresponding to any elements 
f, 9 there exists an element sup (f, g) which is the smallest element = f, = g, and a 
similar element inf (f, g), (ii) f > 0 impliesO0 > —f, f > gimpliesf +h > g + h, 
(iii) f > 0, a > 0 (ais a real number) implies af > 0. 


‘H. Freudenthal, Teilweise geordnete Moduln, Proceedings Royal Academy Amsterdam, 
36(1936), 641-651. 

* We are using the notation of L. V. Kantorovitch, Lineare halbgeordnete Ratime, Recueil 
Mathématique, Moscow, 2(1937), 121-168. 

*S. Bochner, Additive set functions on groups, Annals of Mathematics, 40(1939), 769-799, 
esp. 774-775. This paper which will be quoted frequently will be referred to as ‘‘A’’. 
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3) There exists a positive element 1 (its multiple with a number a will be also 
denoted by a) such that corresponding to any f €C there exist numbers p, q for which 


(1) psfsq. 


A field [|C, M] has the further property: 

4) Mf is a number which is defined for all f ¢C such that (i) M(af + bg) = 
aMf + bMg, (ii) Mf > 0 for f > 0, (iti) M1 = 1. 

A complete field [C, M] has the additional property 

5) If fn, gn €C such that 


(3) lim’ My, = lim Mg. 


then there exists one and only one element h, such that 
(4) f Jn 


The element h is unique. If h and h; both satisfy (4) then so will their sup 
and inf. But 


(m,n = 1,2,---). 


fm 2 sup (h, hi) = inf (Ah, hi) = gn 


implies 
M (sup (h, ki) — inf (A, hi)) S Mfn — Mg, 
and therefore, by (3) and 4), (ii), 
sup (h, hi) = inf (h, hi). 


Hence h = h,. 

A Boolean algebra B of elements {£} has the property 

6) B is a partially ordered set with a smallest element 0 and a largest element 
E = B, and two operations of “addition” and “multiplication” which are com- 
mutative and associative each and distributive with each other and an operation 
of complementation such that (i) E, < E> if and only if E, = E,E2, (ii) E+E = 
EE = E, (iii) = = (iv) corresponding to elements 
E, S E; there exists an element E3(= — E;) for which = 0, + Es = 

A (Jordan) field |B, vE] has the further property 

7) 00 = 0, > E > 0, vB = 1, + Es) = vE, + if = 0. 
The measure vE will also be denoted by | E |. 

A complete field |B, vE] has the additional property 

8) If , Ex B such that 


(6) lim | = lim | | 
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then there exists one (and only one) element EZ such that 
(7) E, = E 2 Ey. 


II. Additive set functions on Jordan fields 


The definitions and theorems given in “A,” p. 773-783 can be made available 
for set functions on a general Jordan field [B, vE] by two different procedures. 
We can either use the theorem of Stone‘ that any Boolean algebra can be repre- 
sented by subsets of a point set; or we can avoid the transfinite induction 
involved in Stone’s proof by duplicating all steps required. Since the second 
procedure requires minor modifications we will describe it briefly. 

On any Boolean algebra B we again have the concept of (finite) partition 
5 = 6(E) = (E,), of refinement: 5 < 6’ and of a monotone sequence of parti- 
tions {6,}, see “A,” p. 771. Every function F(Z) on B will be an additive set 
function of bounded variation the variation being defined as 


It is known’ that the space of these functions satisfies our properties 1) and 2) 
if we define F = G to mean that F(Z) = G(E) forall EZ « B. In order to obtain 
other properties we have to consider a field [B, vE]. In this case we say that 
F(E) is constant on E, if for all0 < FE < Ky 


F(E) F (Ep) 
(9) 


The common value of these quotients will again be called the value of F on Ey. 
A step function is one which has a constant value a, on each element £, of a 
suitable partition (Z,). Given A eB, the characteristic function w,(Z) is the 
step function which has the value 1 on A and the value 0 on B — A. Accord- 
ingly a step function can be written in the form 


(10) F= We, - 


Defining now the element F = 1 to be the characteristic function w, and putting 
MF = F(B) it is easily seen that the totality of step functions is a field [C, M]. 
There exists a smallest complete field [R, M] containing [C, M]. An element F 
a to R if corresponding to any ¢« > 0 there exist two step functions ®, 
such that 


(11) M(@-W) <e 


‘M. H. Stone, The theory of representations for Boolean algebras, Trans. Amer. Math. 
Soc., 40(1936), 36-111, esp. Theorem 67, p. 106. 
*S. Saks, Theory of the integral (Stechart) 1937, p. 10 and p. 61. 
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The class R corresponds to the class of Riemann integrable functions.’ The 
Darboux criterion for integrability is, assuming 6 = (E,), 

_ F(E") 
| 
A class Ry which lies between C and F is defined by the severer requirement 
F(E’) F(E&”) 


| 


However [Ry , M] is net a complete field. 
None of these fields is complete as a Banach space with the norm (8). A 
Banach space arises if we consider the space V,, 1 S p < © of all functions 


with the norm 
F(E, P\\/p 
lle = {lim 


and the space V,, with the norm 


(12) lim > sup 


-|E,| =0 (E’, €E,). 


(13) lim max =0 (E’, E” ¢E,). 


6 


| F(E) | 
and the subspace AC of V; consisting of functions which are absolutely con- 


tinuous relative to the measure vZ. In this connection, F(Z) is absolutely 
continuous if there exists an error function 7(e), lim n(e) = 0, such that 
e—0 


| P(E) | n(| £)). 
If F is any function and 6 = (E£,) is any partition, then the projection F; 


F(E,) 
| 


of F on 6 is again the step function which on EZ, has the constant value 


We again have the following theorem (see “A,” theorems 5-10): 
THEOREM 1. In the Banach spaces AC and V,, p > 1, the step functions are 
dense in norm. If {6,} is a monotone sequence of partitions then 


lim || Fs,, — Fs, || = 0 (m,n — 


In particular, if the space is separable it has a basis consisting of step functions. 
The conjugate space to AC is V,,. A sequence {F,} is weakly convergent in AC 
if the norms || F,|| are bounded and the functions F, are uniformly absolutely 
continuous and the limit of F,(E) exists for each E. 
The spaces V, and V,,p > i,q > 1, p = q/q — are conjugate. A sequence 
{F,}, F, € Vp is weakly convergent if || F, || < K and lim F,(E) exists for each E. 
Also if {F,} converges weakly to 0 then for a suitable subsequence F, : 


il1<ps2 
if p22. 


p=1 


(14) 


6 More precisely, this type of integral has been first defined in its general form by A. 
Kolmogoroff, Untersuchungen tiber den Integralbegriff, Math. Annalen, 103(1930), 654-696. 
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Finally we observe that the mapping process described in “A,” pp. 780-781 
can be applied to prove the following extension.’ 

TueorEM 1’. Theorem 1 with the possible exception of relation (14) also holds 
if the values of our functions are not numbers but elements of a Banach space X 
which has the property that every function f(t) from the interval 0 S t S 1 to X 
which has bounded variation is differentiable almost everywhere. 


III. Completion of fields 


THEeorEM 2. Corresponding to any field [C, M] there exists a complete field 
(C, M] which is the smallest complete field containing it. . 
We construct the space 7’ of sequences 


¢= {hj} = hi, }, 


v= {gc}, x = {hx} ete. with elements of C. We define: (i) ¢ = vif fi = 
for allk = ko and (ii) ag + by = {af + bg.}. This implies that g = y if and 
only if f, = gx for all k = ko, that 0 = {0}, and that (iii) sup (vy, y) = 
, gx) }, inf ¥) = inf (fi , The space 7 has the properties 1) and 2) 
and in addition the following property: If 


Gn = Ont, Vari 2 Gn Vn n=1,--- 


then there exists an element x for which 
Gm =xX2n- 


In order to prove the latter property we set up sequences gn = {fr },Wn = {gr}. 
Altering a finite number of elements f; first for n = 2, then for n = 3, then 
forn = 4 ete., we may assume that f; = fr” for all n and all k. Similarly 
we may assume that g;*’ = gi for all n and all k. Now, corresponding to 
each n there exists an index k(n) such that ff = gz fork = k(n); and it is easy 
to see that our property will hold if we put x = {hx} where 


he = fe for k < k(2) 
h. =fe for k(n) Sk <k(n+1). 


The original space C will be a part of 7' if we identify any f of C with the 
sequence {f}. 

Now let ge 7. We temporarily call ¢ integrable if corresponding to any 
¢ > 0 there exist elements f, g of C such that 


(15) Mif—-g)<e 


The common value of inf Mf and sup Mg will be again denoted by My. We 
claim that the totality of integrable elements has all properties of a complete 


"See S. Bochner and A. E. Taylor, Linear functionals on certain spaces of abstractly- 
talued functions, Annals of Math., 39(1938), 913-944; B. J. Pettis, Differentiation in Banach 
spaces, Duke Journal, 5(1939), 254-269. 
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field [Cy , M| except for the one deficiency that ¢ > 0 implies Mg = 0 and not 
necessarily Mg > 0. The reader will have no difficulty in proving the proper- 
ties concerning [Cy , MJ. As an illustration we will verify one of them, namely 
that if g, g: of 7’ belong to Cy, then so does sup (¢, ¢1). In fact if f => 9 > qg 
and f; 2 g: 2 gi then 


sup (f, fi) 2 sup (y, ¢1) 2 sup (g, 9) 
and our assertion follows from 
M (sup (f, fi) — sup (g, :)) = M sup (f — sup (g, 9), fi — sup (g, m:)) 
< M sup 9, f: — m) M(f 9) + — 9). 


In order to remove the deficiency mentioned before we apply a process of 
identification. Putting 2|¢| = 9 + sup (, 0), the totality of all elements 
g €Cy for which M(|¢|) = 0 are a subspace S with properties 1) and 2) and 
we define C as the class of residues of C) mod S. Thus denoting for a while 
elements of Cy by f, g etc., elements of S by ¢, ¥, --- ete. and elements of C 
corresponding to elements f, g, --- by f, 9, --- respectively, then f = ag + bh, 
f > 9, f = sup (g, h) shall mean that for appropriate elements ¢, y¥, x, f = 
= sup +¢,h+ ¥). Denoting 
the resulting field by [C, M] it is not hard to verify that it has all properties 
of a complete field. 

The reader will easily establish the following extention of Theorem 2. 

TuHrorEeM 2’. Theorem 1 also holds if the values of the functional Mf are not 
necessarily real numbers but elements of another partially ordered space which has 
properties 1) and 2) and in which every set of elements which is bounded from 
above (below) has a sup (inf). 

Another point worth mentioning is the following obvious 

REMARK TO THEOREM 2. If C has properties 1)-3) and property 4) is satisfied 
for two different functionals M and M’ and if the functionals M and M’ are 
absolutely continuous each with respect to the other then the corresponding 
smallest completions C are the same. 

The completion of Jordan fields could be proven directly in an analogous 
way.” However indirectly it is a consequence of theorem 2 and the following 
theorem 3. 


IV. Generation of Jordan Fields 


TuEorEM 3. Corresponding to any complete field {C, M] there exists a field 
[B, vE] which is also complete such that C belongs to class Ry on that field and 
Mf = f(B). 

Also if [C, M] is a field of which [C, M] is the completion then the set C is dense 
in the spaces AC and V,, p > 1. 


*H. M. MacNeille, Extension of measure, Proc. Nat. Acad. Science, 24(1938), 188- 193. 
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Proor. We consider all elements of C for which 


(16) 
and 
(17) inf @, 1 — w) = 0. 


They give rise to a Boolean algebra B if we interpret sup as addition and inf 
as multiplication. The correspondence between an element of C and an element 
E of B will be expressed by the notations wg , E,. The measure | Z, | = Mw 
makes B into a field [B, vE] whose characteristic functions can be readily identi- 
fied with the elements wg of C we started from. In order to prove the complete- 
ness of our Jordan field we have to prove that the relations 


We, Wm (m,n = 1, 2, ---) 
lim Mom = lim Mom 
inf @,,1—o,)=0, inf @r,1— =0 
imply (17). But this follows immediately from 
M inf ,1 = M (inf @, , 1 — w) — inf ,1 — wn)) S M — 
and 
0 < M inf @, 1 — < M inf @,, 1 —»). 
We next take a fixed element f of C and we introduce the elements 

(18) f(a) = inf (f, a) —-x<a<co 


The function f(a) is monotonely increasing (non-decreasing) with a, also 
f(b) — f(a) = inf (J, b) — inf (f, a) 
2 


and thus 
(20) 0s f(b, a) £1. 
Furthermore, for a = 0,6 = 0,a + 6 = 1, 


Kaa + 8b) — af(a) — Bf(b) = {inf (af, aa + Bb) — inf (af, aa)} 
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and thus f(a) is concave. In particular we conclude that for arbitrary a, 
h>0,k > 0,f(a — h, a) is monotonely decreasing as h decreasing, f(a, a + k) 
is monotonely increasing as k decreases, and 


(21) fla—h,a) 2 fla,a+k). 
Similar inequalities hold for the numerical functions 
g(a) = Mf(a), g(a, b) = Mf(a, b). 


In particular, since g(a) is concave its derivate exists at all but a countable 
number of exceptional values a. Thus, for non-exceptional values a 


lim g(a — h, a) = lim g(a, a + k). 
h-0 k—0 


By property 5) this implies the existence of a unique element g, for which 
(22) f(a — h, a) = ga = f(a,a +h). 


We are going to show that it is a characteristic function. Equation (16) 
follows immediately from (20) and (22). Furthermore we have, for h > 0, 


h-inf (f(a, a + h), 1 — f(a — h, a)) 
= inf {inf (f,a + h) — inf (f, a), h — inf (f, a) + inf (f, a — h)} 
} = —inf (f, a) + inf {inf (f, a + A), inf (f + h, a)} 
(f,a) + inf (f,a +h, f + h, a) 
| = —inf (f, a) + inf (f, a) = 0, 
and therefore 
0 S inf Ga, 1 — ga) S inf (f(a — h, a), 1 — fla + h, a)) 
= inf (f(a — h, a), 1 — f(a +h, a)) — inf (f(a,a + h), 1 — f(a—h, 
= 2 inf (f(a — h, a) — f(a,a + h)); 


these relations remain valid if we form the functional M for all terms. Doing 
so and letting h > 0, we obtain M inf (ga , 1 — ga) = 0 and thus ga also satisfies 
relation (17). 

The element of B which corresponds to ga will be denoted Aq. It is mono- 
tonely decreasing with a, and if (1) holds then 


eh: 


(23) wa4,=1 for a<p, wsa,=0 for g <a. 
We now take any finite set of non-exceptional numbers by < bi < --- < bn; { 
bo < p, bn > q, and in relation 
(b — f(b) — fla) (b — (a 


| 
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we put a = @,1, b = a, and we sum over v. Using relation (23) and the rela- 
tion f(a) = a, a < p, and writing E, = As,_, — A», we hence obtain 


and hence 


Since the maximal difference b, — b,1 can be made arbitrarily small, relations 
(24) and (25) show that f belongs to the class of functions Rp on [B, vE]. 

The proof of the last part of theorem 3 runs as in “A,” p. 775, bottom. 

The reader will easily establish the following extension. 

TurorEM 3’. Theorem 3 also holds if the values of the functional Mf (and 
the measure vE) are elements of another partially ordered space S which has proper- 
ties 1) and 2), in which every set which is bounded from above (below) has a sup 
(inf) and which satisfies the following condition: 

(i) Every concave function g(a), p S a S q, from real numbers to S has a 
derivative (in the sense of order) for a dense set of values a. 

It is interesting to consider the following modification of this condition. 

(ii) Every monotone function z = f(a) is continuous for a dense set of 
values a. 

(iii) The points a at which f(a) is discontinuous have a potency smaller than 
that of the continuum. 

(iv) The discontinuities of f(a) are at most denumerable. 

(v) The space S is complete in a Banach norm || z || for which 2; > x2 > 
> = inf (a1 , 22, --- ) > Oimplies || || > || ze || > --- || || > 0. 

Conditions (i) and (ii) are equivalent. This follows easily from the facts 
that for g(a) concave, the function 


h) — 
h 


(26) fla) = ing 2 + 
h>0 
is monotonely decreasing, that for f(a) decreasing the function 


(27) e(a) = + fla) da 


exists and is concave and that (26) implies (27). Conditions (iii) obviously 
implies (ii). However (ii) also implies (iii). In fact if (iii) does not hold then 
there exists a monotone function g(a) such that 


Dg(a) = inf (g(a + h) — g(a — h)) 
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is > O for values {a,}, the index v ranging over the continuum 0 < » < ], 
We now consider the functions 


0 for a<yp 


Dg(a,) for azv 


and at each point we define 


f(a) = sup f(a). 


It is not hard to decide that f(a) has the jump Dg(a,) at a = v and thus violates 
(i). Condition (iv) implies condition (iii), and under the continuum hypothesis 
it is equivalent to it. Finally (v) implies (iv) and thus all other conditions. 
In fact, by our assumptions, || f(a) || is increasing and || Df(a) || = D || f(a) ||. 
Thus Df(a) > 0 for a countable number of points only. 


PRINCETON UNIVERSITY. 
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ON THE MEAN ERGODIC THEOREM 
By L. W. Conen 
(Received May 23, 1939) 


The mean ergodic theorem of von Neumann’ has been extended to spaces 
L,(p = 1) by F. Riesz.” K. Yosida,* using a method similar to Riesz, obtains 
the following extension: If 7 is linear on a Banach space B to B, the iterates 1" 
of 1 are bounded and 


Lee = Ts, n=1,2,-:-, 
j=1 

is a weakly compact set, then there is an xo € B such that Txo = x9 and lim L,x = x. 
The theorem may be regarded as stating‘ that the sequence 7x is transformed 
by a regular matrix’ namely that of the Cesaro means of order 1, into a sequence 
L,a« which, if it is weakly compact, converges strongly to an element invariant 
under 7’. A natural problem is that of determining a class of regular matrices 
a,; such that if 


is weakly compact, then the sequence converges strongly to an 2 ¢B and 
Tx) = x. It will be shown that a sufficient condition for such a class is that 


lim | — = 0 
k j=k 


uniformly in n. In particular this is the case for the matrices of the Cesaro 
means of any positive order. 

We first establish the 

Lemma. If T and L,,n = 1, 2, --- , are linear transformations on a Banach 
space B to B such that TL, = L,T and, for some x, lim L,(« — Tx) = 0 and 
L,x converges weakly to xo , then Tao = Xo. 


‘J. von Neumann, ‘‘Proof of the Quasi-Ergodic Hypothesis,’ Proc. Nat. Acad. Sci. 
vol. 18(1932) p. 70. 

* F. Riesz, ‘“Some Mean Ergodic Theorems,’’ Jour. Lond. Math. Soc. vol. 13(1938) p. 274. 
Cf. E. Hopf, Ergodentheorie, Berlin, (1937) p. 23. 

*K. Yosida, ‘‘Mean Ergodic Theorem in Banach Space,”’ Proc. Imp. Acad. Tokyo vol. 
14(1938) p, 292. 

‘The writer is indebted to Prof. J. D. Tamarkin for acquainting him with Yosida’s 
note in his lectures and for discussing the general problem with him. 

*A matrix aj; is regular if for every convergent sequence of numbers é, lim, 
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Proor. Let @ be any linear functional on B. Then 


(1) lim @(Lnx — 2%) = 0 


from the definition of weak convergence. Since 7’ is linear on B to B 


(2) lim #(TL,2 — Tx) = 0. 


From lim L,(2 — Tx) = 0 and the continuity of @, 


(3) lim — = 0. 


Since 7' and L, commute, we may write for all n 
E(to — Tx) = — Lat) + — + — Tx). 


Thus from 1, 2, 3 it follows that @(a — To) = 0 for all linear functionals on B. 

eae ; TueoreMm. If 7 is linear on a Banach space B to B such that || T" || < 
i | A(T" = TT"), an; is a regular matrix such that 


lim | — @nj| = 0 
k jak 
uniformly in n and 


j 


j=1 


is a weakly compact set, then there is an xo ¢ B such that Tx = xp and lim Lyx = %. 
Proor. The Toeplitz conditions’ for a regular matrix a,; are 


(1) 2 |anj| $M, (n= 1,2,---); lima,; = 0, (j = 1,2, ---); lim = 1. 
n 
Since B is complete and for ze B 


t 
j=s 
the L, are defined on B. Further || Z, || < AM since 


t 


7= 
Also 
j=1 


O. Toeplitz, “Uber allgemeine lineare Mittelbildungen,”’ Prace. Mat. Fiz. vol. 22(1911) 
p. 113. 
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Now for anyze«B 


|| Lu(z — Tz) || = 


Tz + (Gn,i41 — Onj)T’ | 
j=1 


k-1 


for all n and k > 1. Given e > 0 there is, by the hypothesis of uniformity, 
a k, such 


ke 
2 
From 1 there is an n, such that 


k 
n>. 
7=1 
Thus 
(4) \| — Tz) || $ 3A |lz|le, n> 


Since L,x is weakly compact there is an 2% « B and a subsequence L,,; such 
that L,,2 converges weakly to x). Thus because of 3 and 4 the 7 and L,,; 
satisfy the conditions of the lemma and 


(5) = %. 


Let By be the closed linear manifold determined by the set of 2 — Tz for all 
zeB. We show that 7 — xe By. If not, there is a linear functional Z on B 
such that 


(6) Fou = 0, ue Bo; —2z)=1. 


Now — By, k = 0, 1, 2, --- , so that %(7"x — = 0. Since 
TxeBand 


— T’r) = — + — = — 


we have 


(7) dor = =1,2,---. 
From 4 and || Z, || < AM by 2, one sees that 
(8) limL,u=0, weBo. 


Since Z is linear we get from 1 and 7 


= anton = n=1,2,---, 


(9) lim = dx. 


=1, 
| 
“Rede 
he 
- 
n 
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Then from the weak convergence of L,,2 to x and 9 


0 = lim — Ln, x) = — lim Folin; = Toto — Fox = — z) 


contrary to 6. Hence x — re B. 
Now from 5, 7%) = %,j = 1, 2, --- and so 


Ln% = Tt = on) Xo, 
i= i= 


(10) lim = Xo 


because of 1. Writing x = 2 + (x — 2) we find from xz — 2 € Bo, 8 and 10 


lim L,x = 2 
: and the theorem is proved. 
The Cesaro matrix 
+j- 
j r>@, n= 1,2,-°-, 


| 
n 


satisfies the uniformity condition since a,; = 0 for 7 > n and 


n 
Jano] + | anita — 
j=0 


Coroutiary. If T is linear on B to B, || T” || S A and 


0, 

n—1 
by is weakly compact, then there is an x such that To = 2 and 
n—-m—1 


Ni 
ze. 
; : n 
| 
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Proor. Since the conditions of the theorem are satisfied there is an 2 «B 


such that 7% = and lim L,z = x. But = = 
= %. Thus 


|| — 2o|| = || T™Ln—m(@ — || S A || — x) || S Ae, 
n—-m>WN,. 


Brown UNIVERSITY. 
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This paper may be read without a knowledge of our first two papers on 


interpolation. 
Let 
7 
af”, as”, 


he oe | be a triangular matrix where for each n 


All the 2!” may be written in the form cos 3”; hence for each I we may 
define a triangular matrix 


with 
(4) os 


Let f(x) be defined in [—1, +1]; then we define the n Lagrange interpolatory 
polynomial of f(x) with respect to Mas the polynomial L,(f) of degree (n — 1) 
at most taking at the points 22", --- , the values f(ak”), faa”), 
f(x&). It may be verified that 


(2) 


1 Some of these results have been presented before the Mathematical Association in 
if Budapest in April, 1937. 
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wheré 
(5b) w(x) = Il = IT (x — 2,). 


We shall explicitly indicate the upper and double indices only when some 
misunderstanding may arise. The polynomials l,(x) (for which we omitted to 
indicate explicitly their dependence upon n) are independent of f(x) and 
dependent only upon MM, v, and n; following Fejér they are called the funda- 
mental functions of interpolation. For these it is easy to verify that 


(6a) L.(z) = L(z) = 1, n = 1,2, 
and more generally 
(6b) L,(r) = = r(x), n=k+1,k+2,.--. 


+1 
where r(x) denotes any polynomial of degree k. The numbers i L(x) dz = 


{ = \, (depending only upon 9M, v, and n) are called the Cotes numbers 
belonging to 2. From (6a) we evidently have 


(7) = = 2. 
y=] 
We intend to consider chiefly the case of two general and very often used 
matrices. The first of them is obtained as follows: let p(x) be nonnegative and 
integrable in Lebesgue’s sense (L-integrable) for [—1, +1]. Then a sequence 


of uniquely determined polynomials wo(x), w:(x), --- , corresponds to p(x) so 
that w,(z) is a polynomial of degree n with 
(8a) coeff. x” in w,(2) = 1 
and 
1 
(8b) dr = 0, n 
The sequence of such polynomials is called orthogonal with respect to the 
weight function p(x). The sequence of polynomials %(x), --- , Qa(a) «++ , for 
which 
1 
(8c) dx = 0 n m, 
1 
(8d) [ 2, (x)? p(x) dx = 1 n=0,1,++-, 
1 


(8e) coeff. x” in 2, (zx) is greater than 0, n=0,1,---, 


n 
tory 
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we call a sequence of normal-orthogonal polynomials with respect to p(z). 
The polynomials w,(z) and ©,(x) evidently differ only in a constant factor 
dependent only upon n. By (8b) it is easy to see that all roots of w,(zx) are 
real and situated in [—1, +1]. Taking these roots for n = 1, 2, --- we obtain 
the so called p-matrix. It is well known that for p(x) = 1 we obtain the 
sequence of Legendre-polynomials P,,(x), for p(x) = 1/-/(1 — 2”) the Tcheby- 
cheff-polynomials 7',(z), and in general, for p(z) = (1 — x)*(1 + 2)* with 
a > —1,8 > —1, the Jacobi-polynomials PS*(z), 

The eae prin of matrices has been found by Fejér’ in his paper about 
Lagrange-interpolation. According to his notation the matrix I is normal, if 


(9a) = 1 — (2 — 0, 
wn (Le) 


-ls2zs +l, k = 1,2,---n, n=1,2,.--, 
and it is strongly normal, if 


(9b) 1 — Da, 
wn (tx) 


—-ls2z8 +1, k=1,2,---n, n=1,2,.---, 


where c,—and later all the other c’s—are positive constants independent of 
x,n,k. Their dependence upon accidental parameters will always be explicitly 
| sated. Fejér proved that e.g. the sequence of Jacobi-polynomials P{*'”(z) 
Ooh Hee presents a normal matrix if -1 < a < 0, —1 < 6 S 0 and a strongly normal 
one, if <<a <0,—1<£8<0. For this second matrix class, by 


(10) > = 1, 
k=l 

we have 


k=1,2,.--n, n=1,2,--- 


Orthogonal polynomials, and especially Jacobi-polynomials, play a most im- 
portant part in many problems of analysis; we mention here only the works of 
Legendre, Laplace, Jacobi, Bruns Tchebycheff, A. Markoff, Stieltjes, Christoffel, 
Darboux, Fejér, 8. Bernstein and Szegé. In the general theory of orthogonal 
polynomials (i.e. for general p(x)) an important step has been made by G. Szegd.. 
He succeeded in proving for a general class of weight-functions the asymptotic 
formulae of Laplace-Darboux concerning Jacobi-polynomials. Thus he proved 


2L. Fejér: Lagrangesche interpolation und die zugehérigen konjugierten Punkte, Math. 

Ann., 1932, pp. 1-55. 
1G. Szegé: Uber die Entwicklung einer analytischen Function usw., Math. Ann., 1921, 

pp. 188-212. ( 
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that if p(x) is such that to p (cos 9) | sin d = p,(#) there exists a function D(z) 
regular in | z| < 1, here # 0 and for almost all 3 ~ 
(12) lim | D(re’’) = pi(d), 


r—1—0 


then in|z| = R > 1 forn— uniformly 


ow (2n)'D () 


which determines the asymptotic behavior of the polynomials for any point of 
the z plane not lying in[—1, +1]. (12) is satisfied, if p(x) = 0, further if p(x) 
and log p(x) are Lebesgue-integrable in [—1, +1]. Further—and this is a 
deeper result—Szego* gave for the fundamental interval itself i.e. for [—1, +1] 
an asymptotic formula 


2 


where a depends in a given way upon p(x). In order to give a simple example, 
he proved this for e S % S m — e and n— o, if p(x) remains in [—1, +1] 
between two positive bounds and the first and second derivatives of p(x) in 
the same interval exist. §. Bernstein’ proved a theorem, which is analogous 
to the above mentioned theorem of Szegé. He proved the asymptotic formula 
(13) if, in [—1, +1] p(x) (1 — 2”) remains between two positive bounds and 
uniformly satisfies here a logarithmic Lipschitz condition with the exponent 
1+. For this theorem, Szegé gives a very simple proof in his book to be 
published. The papers of J. Shohat* also contain general results of this kind. 

The problems concerning orthogonal polynomials can be divided into four 
classes: a) the behavior of the polynomials within the interval [—1, +1] 
((tnternal behavior), b) the behavior of the polynomials upon the plane cut along 
[—1, +1] (external behavior), c) distance of consecutive roots (problems of the 
jiner distribution of roots), d) number of roots in a fixed subinterval (problems 
of the mean-distribution of roots). Problems concerning a) are completely 
solved by Szegé and Bernstein for a rather general class of weight-functions; 
if we require the weight function only to satisfy 


(I4a) p(x) = ee -ls2zs8 +l, 
(14b) p(x) is Lebesgue-integrable in [—1, +1], 


*G. Szegé: Uber den asymptotischen Ausdruck von Polynomen, die durch eine Ortho- 
gondlitatseigenschaft definiert sind, Math. Ann., 1922, pp. 114-139. 
_°S. Bernstein: Sur les polynomes orthogonaux on a segment fini, Journal de Mathema- 
tiques, pp. 127-177. 

*See J. Shohat: Théorie générale des polynomes orthogonauz de Tchebichef, Mémorial 
des Sciences Mathématiques, Fase. LXVIII. 
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then Shohat® gives an upper estimate for the orthogonal polynomials belonging 
to p(x). As far as we know there are no other general results in this direction, 
Question b) is settled by Szeg6 for rather general weights. As for c) we obtain 
from Szeg6’s formula that if the weight function throughout a subinterval has 
derivatives of the first and second order and remains throughout [—1, +1] 
between two positive bounds (Szegé gives some other, more general condition), 
then for the n‘* fundamental points zi". = cos 3 ‘ lying in that subinterval, 
we have lim n(#{?} — 3{”) = 2; we obtain the same result for consecutive 


i fundamental points from Bernstein’s theorem, if throughout the interval 


(15a) cs = (1 — 2’) Se 
and if for p(cos #) sin 8 = t(8) throughout and uniformly in (0, 7] 


(15b) + h) — t()| < 1 


lo 
Concerning d) Szegé’ implicitly proved, that, if the weights are non-negative in 
[—1, +1], and if in the same interval p(x) and 
Ae | grable, then the distribution of the n* fundamental points is uniform, which 
means, that if 0 < a < then with 2§” = cos 


log p(x) are L-inte- 


(16) lim aso” <p. 

i 

Szegé’s and Bernstein’s methods are based upon asymptotic formulae for 
polynomials. But it is probable that in the general case such a formula does 
not exist not even for continuous weights remaining between two positive 
bounds. Thus, in this way we cannot obtain any answer to questions such as 
e.g. what is the effect of the singularities of the weight function (loci of dis- 
continuity, infinities, zeros) upon the distribution of roots, whether this effect 
is only local ete. The investigation of this last question will be a main object 
of our paper. Here we make use of a principle introduced by Fejér: we derive 
the structure of the matrix from the properties of interpolatory fundamental 
functions belonging to Yt. Fejér deals with two such properties. The first” 
is the property of being strongly normal, from which he deduces the relation 
(17) lim max — 2) =0, 


v=1,2,-++(n—1) 


t 4 which—from what precedes—means a statement about the distribution of roots 
of certain Jacobi-polynomials. The second property’ is the non-negativeness 


7L. Fejér: Mechanische Quadraturen mit positiven Cotesschen Zahlen, Math. Zeitschr., 
1933, pp. 287-310. His proof gives also the following result: if there exists for the matrix 


no 
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of the Cotes-numbers belonging to It, from which we once more obtain (17). 
Theorems deducing properties of 9 from some interpolatory properties we shall 
call Fejérian theorems. We proved® two such theorems, the application of 
which to p-matrices gave the following two theorems. 

I. If throughout [—1, +1] ¢ S p(x) S cz and p(x) is L-integrable, then ' 


¥=0,1,2,---(n—1),n, n=1,2,---, 8 =0, 
for the corresponding matrix IN’. (See (3).) 
Il. If throughout the interval [—1, +1] p(z) = 0, p(x) and sts) L-integrable, 
then 


< (n + 1) 


(n) (n) 
v n 


v=0,1,---n. n=1,2,---. 


By systematic application of Fejér’s principle we obtain Fejérian theorems 
for each of the four classes mentioned above, theorems, which may be applied 
to p-matrices as well as to strongly normal ones. Properly speaking we deduce 
the theory of both classes of polynomials from that of a more general class of 
polynomials, the roots of which form a matrix JN, and for which the values of 
the fundamental functions 1,(z) satisfy certain conditions. 

In §2 we consider problem a). That will be the only section in which we 
shall not explicitly express a Fejérian-theorem. Our theorem I asserts for 
strongly normal polynomials 


8 Vn 
in| +1, Hin 
lon(z) | +l n= 1,2, 
where ¢, is any constant for which (11) is valid. This result cannot be essen- 
tially improved in [—1, +1], but it is probable, that in [—1 + ¢, 1 — e] the 
factor with 1/n can be omitted and the factor 8/+/c, replaced by a cy = 
Co(:, €). Theorem II applies to the orthogonal polynomials and it states that, 


Wa function s(x), non-negative and L-integrable on [—1, +1], positive in [a, 6] and such 
that 


1 
1 


then (17) holds in [a, 6]. This is satisfied e.g. if the matrix M is a p-matrix and s(x) = p(z); 
hence the roots of the polynomials orthogonal to a weight-function, non-negative and 
L-integrable on [—1, +1] and positive throughout the subinterval [a, 6] cover the interval 
(a, b] everywhere densely. 

*P. Erdés and P. Turén: On Interpolation II, Annals of Math. 1938, pp. 703-724. 
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if the weight-function is non-negative and L-integrable in [—1, +1] and if 
throughout the subinterval [a, b] p(z) 2 m > 0, then 


(18a) jot) <| in aS2sb,n=1,2,..., 


as) <| [ p(w 
in 


For the case a = —1, b = +1 these estimations have been presented by Shohat. 
By the same method we obtain lower estimates for the orthogonal polynomials 
wn(z). More exactly: if the weight p(x) is non-negative and L-integrable 
throughout [—1, +1] and p(x) = m > 0 throughout a subinterval [a, 5], then 
for any x (real or complex) 


4 n 
(19a) |wn(x) | = | en = 1 ——]) |x - 


(n) 


where xz," denotes the n* fundamental point nearest to z. As a matter of 
fact this has importance only for the interval [—1, +1]. If in addition to the 
above properties throughout a subinterval [c, d] of [a, b] the weight is bounded, 
p(x) S M, then a factor with »/n can be appended to the right side, if we take 
C1 =¢yn(M). Fore = a= —1,d =b = +1 we find implicitly and qualitatively 
the same as Shohat.’ By this and by the results of §3 we obtain e.g. that if 
for the L-integrable weight function p(x) = m > 0 throughout [—1, +1] and 
if p(x) < M throughout the subinterval [e, f], then w,(x) takes in any [2S , 2+”) 
lying in [e + e, f — ¢] a value, greater than 


M, Mm, @, f) 
2"4/n 


(19b) 


It is probable, that in (18a) and (18b) the factor n or »/n may be improved to 

C13(a, b, m)+/n or to cu(e, a, b, m) respectively—this is true in the mean—and 

also in (19b) we may omit from the denominator the factor with /n. If in 

[—1, +1] p(x) = m > Oand in the subinterval [a, b] p(x) < M, then we proved 

that there exists an n(n) such that n(n) > 0 for n— © and that in [a + «,b — 4 

vn 
Qn 


| wn(z) | < Cis(a, b, €é, M 


We omit the details of the proof. 


® See footnote 6, p. 41, formula (60). 
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In §3 we are concerned with b) problems. The base of the investigation is 
the following Fejérian theorem: If for the matrix 2 for every « > 0 


20 
n > Cr6(€), k =1,2,---n, -lszs +l, 


then for any fixed z of the complex plane cut along [—1, +1] we have 


(21) lim [con(z) z+ 1) 


where we are to take those values of the n** and square roots, which are positive 
on the positive real axis. Condition (20) is abundantly satisfied for sequences 
of strongly normal polynomials. Thus the asymptotic representation (21) 
applies for these too. 

We shall see that if in [—1, +1] the L-integrable p(x) is 20 and its roots form 
an aggregate of measure 0, then (20) is satisfied hence (21) holds too. Formula 
(21) presents less than the above quoted formula of Szegé but it refers to a 
wider class of weight-functions: e.g. (21) holds for the weight-function p(xr) = 
c !** whereas Szegé’s formula has nothing to say in this case. 

We shall give a direct and elementary proof of the aforesaid theorem, but we 
are bound to mention that it is to be deduced indirectly from a deep theorem of 
L. Kalmar” by the following note of Polya:" If upon the matrix I we have 
uniformly in [—1, +1] 


lim | + ++» + |Un(a) = 1, 


then the Lagrange parabolas taken upon M of a function f(x) analytic in this 
interval uniformly converge to f(x). In order to prove this note standard 
theorems about approximation of analytic functions are required. 

On the other hand by a further theorem of Kalmar” it follows, that the ele- 


L. Kalmar: Az interpolétiérél, Mathematikai és Fizikai Lapok, 1927, pp. 120-149 
(Hungarian). This gives the following result: Let 8 be the closed interior of Jordan-curve 
!on the complex z-plane and let x = ¢(z) be regular on the exterior of / and continuous 


on the closed exterior of 1, which maps % upon the exterior of a circle | z| S ¢ with lim 
|z| 


n 
>. 1. Let the matrix Mt be given in B and w,(z) = Il (z —2\™). Then a necessary 
v=1 


and sufficient condition that lim La(f) = f(z) uniformly in % for any f(z) regular in % is 
that jim [wn(z)]/" = 9(z) for any z of the exterior of 1. We use this only in the case if 1 is 


the interval [—1, +1]. 
ian Polya: Uber die Konvergenz von Quadraturverfahren, Math. Zeitschrift, 1933, pp. 
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ments of IN’ belonging to M are uniformly distributed in [0, z]."” From this we 
incidentally obtained the following result: if p(x) is non-negative and L-inte- 
grable in [—1, +1], and further, if the roots of this p(x) form an aggregate of 
measure 0, then the elements of the matrix Dt’ belonging to the roots of the 
respective orthogonal polynomials are uniformly distributed. (We can prove 
this result in a direct and elementary way, too.) 

In §4 we consider c) problems. The basis of the general consideration is 
given by the following Fejérian-theorem: If a matrix Mt’ is such, that for a 
subinterval [a, 6] of [0, 7] with 


HI <as KH” <I <... <9” < of} 
we have 
S$ K, k=v,v+1,---n, a@=cosB cosa=b, 


and, in the same subinterval, the absolute value of the other n™ fundamental 
functions does not exceed cn“!’, then 
€, a, 0) K 


4 

if and are in [a + ¢, — If [a, = [0, =], then cs is independent 
of ¢ and the estimate holds for all [8{”, d4f1] (k = 1, 2, --- ,m — 1), the upper 
estimation holds even, as we proved’ for k = 0 and k = n, if aK” = 0, OW, = 2. 
The content of the theorem may briefly be expressed as follows: if the funda- 
mental functions belonging to the fundamental points of a subinterval are 
bounded and the other fundamental functions are in the same subinterval 
not excessively great, then the distribution of the matrix is approximately uni- 
form in that subinterval. In our paper cited under’ we already proved, that 
the estimate of the form (22) holds in the case of strongly normal polynomials 
for any pair [8%”, d¢2}] with an absolute constant cs. For orthogonal poly- 
nomials we obtain that, if the L-integrable weight function is non-negative in 
[—1, +1] and if 0 < m S p(x) S M ina subinterval [cos 8, cos a], then for 


any pair in [a + — we have 


(22) 


Cig(m, M, a, B, < of” < Cao(m, M, a, B, €) 
n = Vk+1 k = n . 


For [a, 8] = [0, =] cio and cy are independent of €; in our paper we proved the 
upper estimate for this case, we omit the details of the lower estimate. 
If in the subinterval [cos 8, cos a] p(x)/(1 — 2°”) = m > 0 and besides it 


2 This means of course, that for any fixed subinterval [a, 8] of (0, x] 
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p(x)V (1 — a’) is continuous, very much more is to be said from the nt* funda- 
mental-points situated in [a + ¢«, 8 — ¢]. In this case we have for n — 


— ~ 


The proof is also based upon the analysis of interpolatory forms but not upon 
a Fejérian-theorem; it is important to notice that the formula obtained for the 
fundamental functions relative to [a + «, 8 — «] is to some extent an asymptotic 
one. The interval [a + 8 — may be replaced by E ,B- 
where A(n), though arbitrarily slowly, tends to infinity, and we nan postulate 
other, more general conditions for the weight function. In the case of strongly 
normal polynomials the former of us proved in another way, that for 


) sy” the difference 3,3} — of” We do not 


sie the details of the aie 


In §5 we consider d) problems. The analysis is based upon two Fejérian- 
theorems. The first of them states, that the uniform distribution in the sense 
(16) of the matrix J is a consequence of condition (20); we give for this a 
completely elementary direct proof. If for a matrix 9% with the absolute 
constant K’ 


L(x) | < K’, -ls2zs +l, y=1,2,.--n 


then more exactly © 


(23) —cx(K’,e){(8 — a)n}* < < {(B — a)n}*** 
<p 


for (8 — a)n > Co3(K’, €). This means, that for uniformly bounded funda- 
mental functions the uniform distribution is already effected for very small 
subintervals [a, 6], the size of which depend upon n. If [a, 8] means any 
interval in [0, ], then by the condition 


(24) | L(x) | +1, yv=1,2,---n, n=1,2,--- 


we have 


(25) | > 1- B — < C26(c2s, Cr, €)n***, 


Which establishes the uniform distribution already for intervals of the length 
1/n'**, This is not very much weaker than the former conclusion. 

By applying the above-arguments to sequences of strongly normal poly- 
homials we immediately see that the fundamental points are distributed accord- 
ing to (23). Thus for orthogonal polynomials we obtained a new and strictly 
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elementary proof of our theorem that, if the L-integrable p(x) weight-function 
is in [—1, +1] not less than 0 and the aggregate of the points x with p(x) = 0, 
is of measure 0, then the distribution of the elements of the matrix 2’ formed 
of the roots of the respective orthogonal polynomials is uniformly dense in (0, 7]. 
Here we must remark, that although our hypothesis is more general than that 
of Szegé, we obtained only the sufficient condition for the uniform distribution 
of the roots; the necessary and sufficient condition—as the first of us proved— 
is connected with the transfinite diameter of the aggregate of points, for which 
p(x) = 0. We omit the proof here. 

Our second theorem states that, if the L-integrable weight function is =m > 0 
in [—1, +1], then (25) holds for the corresponding matrix J’; if in addition, 
for [—1, +1] M = p(x)+/(1 — 2°) = m, then (23) holds too. If the aforesaid 
conditions are valid only for a subinterval and for the complementary sub- 
interval of [—1, +1] we postulate only the non-negativeness and the L-inte- 
grability, then nothing may be said with respect to the d) problems. 

From the point of view of the theory of uniform distribution we make fol- 


lowing remarks. Weyl’s criterion for the uniform distribution of 2 under (1) 


: n 
postulates, that for n — © the expressions s, = Dr en" tend to 0 for any 


positive integer k. Our theorems of §5 deduce the uniform distribution from 
the behavior of certain polynomials associated with Mt. It is to be noticed 
that instead of asymptotic equalities we have in the condition only inequalities 
and that we obtain also an error-term, that could not be obtained by Weyl’s 
criterion. It would be plausible to ask, whether the uniform distribution with 


error-term is to be deduced from an inequality relative—in [—1, +1]—to 


wn(x) itself. The answer is affirmative; if in [—1, +1] |,(x)| S = with 


A(n) = 2, then for a fixed subinterval [a, B] we have 


< 8[nlog A(n)]'. 


v 


We will return to this problem on another occasion. If we disregard the error- 
term then, as we learned later the theorem is contained in a general theorem 
of Fekete” stating that the distribution of a matrix I* given upon any Jordan- 
curve l is uniform, if upon 1 the inequality [| w,(zx) |” < M holds, where M 
denotes the transfinite diameter of the Jordan-curve. Our argument essen- 
tially differs from his method. 

From what is said before the reader may see the chief results of this paper: 
the uniformity of the method, the statement that the polynomials and their 
roots essentially depend only upon the local values of the weight function and 
asymptotic formulae of more general validity than before. We hope to con- 
sider the other fundamental problems in another paper. 


18 Oral communication. 
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TurorEeM I. For strongly normal matrices we have in |—1, +1] 
8 Vn 


Proor. As the arithmetic mean is not less than the geometric mean we 
may write 


n n 


v=1 


As the 2,’s are in [—1, +1], we have after Schur™ 


n n+} 
IT | | < 


i.e. 
2—2/n 1 Cogn” 
| wn(x) | < cn Q2n 
hence 
| wa(x) | < Qed. 


This proof is very simple, but Schur’s theorem which we applied is not of 
interpolatory nature. Hence it will perhaps be of some interest to give another 
proof for it. We require 

Lemma l. If1 > > 4” >... > = —1, then 

= 1 n—2 
—+j~—— 22°. 
vel | wn (xy) | 
1 sin (k + 1)0 
(Equality only for w,(x) = (a? — 1)U,-2(x), where U; (cos 8) = 5 
but for the present we shall not use this.) 

Proor. Let us fix in [—1, +1] the values & > & > --- >£, and let us deter- 
mine the polynomial f(x) of degree (n — 1), for which coeff. x” * = 1 and 

max |/f(é)|is minimum. According to standard theorems such f(x) exists 


and takes at the places ¢ with alternating signs the same absolute values 
(v= 1,2,.--,m). Thus by coeff. z” ' = 1 we have 
yes] w’ (é,) 
f(x) n (—1)"" 
yal w’ (é,) 


“I. Schur: Uber die Verteilung der Wurzeln bei gewissen algebraischen Gleichungen mil 
ganzzahligen Koefizienten, Math. Zeitschrift, 1918, pp. 377-402. 
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where w(z) = 7. (x — &). The minimum value is given by the formula 
1 


M ‘on — 
n n 1 
i.e. 
rai | | 


since for f(z) = Tra(x)(Tr+ (cos #) = 


cos (n — 1)8), in [—1, +1] 


By taking the reciprocals we obtain the Lemma. 


1 
Qr- 
. By the Lemma we immediately obtain that 


max | T,-1(x) | = 


=2 s2vn[ s s vn, 


which establishes the theorem. Notice that in both proofs we used only the 
fact that 

This result is not to be improved essentially in [—1, +1], that is to be seen 
by the matrix given by the roots of the Jacobi-polynomial PS ‘~°(zx) for « 
being any small fixed positive numbers. Its being strongly normal we already 
mentioned in the introduction. On the other hand by 


n 


PR? (1) ~ cule) 


a>-Il1p>-l 


we have 


THEOREM II. If the L-integrable weight-function p(x) is non-negative in 
[—1, +1], and for the subinterval [a, b] = m(> 0), then in [a, b] 


whereas in [a + b — 


i 
{ 
t 
i 
4 


mM 
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Proor. As is known—and it may easily be verified  @,(2) minimizes the 


integral J (f) 
with coeff. x” 


1 
r f(t)’p(t) dt, if f(t) runs over the polynomials of degree n 


1. Thus fora S zx S b we have 


where 7’, (cos 8) = ns , cos nd. Hence 


(26) m [ dt — Lf p(t) dt. 


But then, according to a theorem of A. Markoff (stating that if fora S x S b 


| F(x) | < M, then here | F’(x) | are n*, where n denotes the degree of F(x)) 
fora < x S b we have 


+ 
By applying to (26) the theorem of Bernstein-Fejér (stating that if for 
Mn 


the degree of F(x)) we obtain fora 


4 1 2n +1 


In connection with theorem II we mentioned that it is probable that the 
factor \/n in (18b) is to be improved to cu(e, a, b, m). This conjecture may to 
some extent be supported by the fact that from (26) 


(27) [ < E [ p(t) a], 


ie. for [a, b] the mean value of | w,(t) | is O J. 


The proof of theorem II is very simple, but it is not of interpolatory char- 
acter; thus we give a proof of such kind which with a slight modification gives 
the lower estimate indicated in the introduction, and besides it contains many 
elements needed in the following investigations. 

Let the numbers 


1 
(28) = { dt = ky, 2, 


denote the Christoffel-numbers belonging to p(x), then we have 


= 
1 
, +1) 
vn, 
y the 
seen 
for 
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Lemma II. Jn [—1, +1] — pi(x) 2 po(x) = O, both L-integrable. If 
L(x) (v = 1,2,---,n;n = 1, 2, --- ) stand for the fundamental functions and k, 
for the Christoffe to p(x), UF (x) and kt for those belonging to 
po(x) respectively, then for any fixed (real or complex) xo 


y=] p=1 


Proor. Let 2% denote any fixed number and determine the polynomial 
F(x) of degree (n — 1) at the utmost, for which F(a) = 1 and I(F) = 
1 


[ | dt is minimum. 
1 


We express F(x) by the interpolatory polynomials belonging to the roots of 
polynomial orthogonal to p:(x), then we have 


F(z) = 
i.e. 
| I(F) = > d, d L,(x)l,(x)pi(x) dx = |d, ’k,, 


p=1 


| as” for ¥ v 


1 1 
if | (29a) 1, (t)1,(t)pi(t) dt a n(a)pr(ax) dx = 0, 
if and 
As 


n 


F(a) = 1 = d,l, (a0), 


| l,(a0) hy |d, 


1 16 
| 


y=] ky 


we obtain from what 0 


| 


i.e. 


I(F) 2 


1° P. Erdés and P. Turén: On Interpolation. I, Annals of Math., 1937, pp. 142-155. 
* Implicitly J. Shohat: Théorie générale etc., p. 47, formula (75). 


y=] 
ul = 
| 
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Equality is evidently to be obtained if and only if 


F(z) = 
y=] k, 
i.e. 
But then 
> [ \F@ ae | 
(31) y=] v F F (x) =ag+ ++ 12" 


In the special case of xo an the v root of the n™ polynomial orthogonal to 
p(x), then by (30) the minimum-value is k, and this minimum is attained only 
for F(x) = 1,(x) (Corollary I).” 

Here wi we remark—although we make no use of it in this paper—that the sum 

derivative. 

In Lemma II let p;(xz) = p(x) and po(x) = m(>0) ifa S x S b, and p,(x) = 
for the complementary intervals; furthermore suppose 2 real. Then the 
explicit form of the polynomials orthogonal in [—1, +1] with respect to po(x) 


is given by 
(- 


where P,,(x) denotes the n** Legendre-polynomial for [—1, +1], with the nor- 
malization P,(1) = 1, A depending only upon 1, a, b so that coeff. x" in w,(x) 
equals 1. As in this case with P,(,) = 0 (v = 1, 2, --- n), 


kt = It (t) dt = 
nf (t) [ (1 Palm)” 


from Lemma II, if &, &, --- , : denote the roots of P, (-1 + az—*) =U, 


is also monotone with respect to p,(x), if If?(2) denotes the r* 


b-—a d 


i I Shohat: On the convergence-properties of Lagrange-interpolation etc., Annals of 
Math., 1937, pp. 758-769, formula (39). 
"LL, Fejér: Az interpolatiérél, Akadémiai Ertesité (Hungarian), 1915. 
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As —1 + 2 eet. m (v = 1, 2,--- mn), making use of the differential 
equation of the Legendre-polynomials we obtain by the substitution x = 
a+ — (1 + yo) in (32) for any real yo 


2 


Equality in (33) holds only for p(z) = m. Suppose now —1 S yw < +41: 


then, by a well known result | P,(yo) | S< 1, and by the above cited theorem of 
Bernstein-Fejér | P’,(yo)/(1 — yo) | S n, ie. from (33) for a < x < b we obtain 


Let now Sy S1-—e. Then according to the classical formula of 
Laplace for < 3 — we have 


by this and by the theorem of Bernstein-Fejér for [—1 + «, 1 — «] we have 


i.e. we obtain roughly” from (33) in [a + ¢ b — ¢] 


1, (x)* Ca3(e) 
34b 
=k 
We remark, that for the validity of (34a) and (34b) in the above intervals 
we require only that the L-integrable p(z) is in [—1, +1] not less than 0, and 
in [a, b] p(x) 2 m > 0. (Corollary II.) 
Let in Lemma II p,(x) = p(x), pe(x) = 0 in [—1, a][b, 1] and not less than 


m 
[a, b]; if v1, y2, --» Yn stand for the roots of the Tchebycheff 
polynomial 7',(x)(T, (cos 3) = cos nd), ue, for the roots of 


b-—a 


by (t) dt 


a= mf 
a [(¢ — a)(b — 1 Ta(y)(t — VC n 


1° Fora = —1, b = 1, see J. Shohat: On Interpolation, Annals of Math., 1933. 
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i.e. by easy computation with real z 


s * 


v=1 W 


1 2(x — a) 


Thus we came to the result, that if the L-integrable p(x) is not less in [—1, +1] 


(34¢) 


m 
than 0 and in the subinterval [a, b] p(x) = then fora SxS b 


> 


v=] k, 


(35) 


(Corollary III). Equality holds only when in [a, 6] p(x) = i. a =o 


and in the complementary intervals p(x) = 0, further x = aor z = b. 
We deduce theorem II from (34a) and (34b) as follows. As 


and as ki” > 0 by (6a) and (29b), we have in [a, b] by ve 


v=1 


and analogously in [a + ¢, b — ¢] 


(Simi) 


a] 


It is to be remarked, that the same argument leads to the following result: 
if the L-integrable p(x) is not less than 0 in [—1, +1] and if in the subinterval 
[a, bj 


But then 


m 


then fora < x < b we have 


Some further corollaries of Lemma II we shall mention later. 


y | 
+1; 
= 
" 
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4+ = (v = 1, 2,---), making use of the differential 


equation of the Legendre-polynomials we obtain by the substitution 2 = 
we b-—a 


(1 + yo) in (32) for any real yo 


2 
2 
(33) yo) Pn (yo)” + n? 


Equality in (33) holds only for p(z) = m. Suppose now —1 S yw S< +1; 


then, by a well known result | P,(yo) | < 1, and by the above cited theorem of 
Bernstein-Fejér | P’,(yo)/(1 — ys) | S n, i.e. from (33) for a < x < b we obtain 


n 2 19 


Let now Sy Then according to the classical formula of 
Laplace for S 3 S — we have 


by this and by the theorem of Bernstein-Fejér for [—1 + ¢, 1 — «] we have 


P,(x) | < | PL (x) | 


i.e. we obtain roughly” from (33) in [a + ¢, b — ¢] 


(34b) < n. 


We remark, that for the validity of (34a) and (34b) in the above intervals 
we require only that the L-integrable p(x) is in [—1, +1] not less than 0, and 
in [a, b] p(x) 2 m > 0. (Corollary II.) 

Let in Lemma II p,(x) = p(x), po(x) = 0 in [—1, al][b, 1] and not less than 


m 
[(e — a)(b — 2)! ifn, 2, --- stand for the roots of the Tchebycheff 
polynomial 7',(x)(T, (cos = cos nd), we, for the roots of 


+ — then 


1? Fora = —1, b = 1, see J. Shohat: On Interpolation, Annals of Math., 1933. 
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i.e. by easy computation with real z 


(34 y=] k, W ysl 
Cc 


Thus we came to the result, that if the in ce, p(x) is not less in [—1, +1] 


than 0 and in the subinterval [a, b] = = then fora S23 b 


> L(x)” < 


y=] k, ™m 


(35) 


(Corollary III). Equality holds only when in [a, 6] p(x) = lo = aa = Pye 


and in the complementary intervals p(x) = 0, further x = aor z = b. 
We deduce theorem II from (34a) and (34b) as follows. As 


=f (Luo) vow = roa 
and as k{” > 0 by (6a) and (29b), we have in [a, b] by (34a) 


(= | 1, (ar) vk.) s k, < m(b a) p(t) dt-n 


and analogously in [a + ¢, b — ¢| 


(= | < ate [ p(t) dt-n. 


p(t at Vn fora+eSuSb—e. 


It is to be remarked, that the same argument leads to the following result: 
if the L-integrable p(x) is not less than 0 in [—1, +1] and if in the subinterval 
[a, 


But then 


2"? < 2 


then fora < x < b we have 
1 
) dt Qn n = 1,2, 


Some further corollaries of Lemma II we shall mention later. 
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Let us now consider the lower estimate of the orthogonal polynomials w,(z). 
TuroreM III. Let the weight-function p(x) be non-negative and L-integrable 

in [—1, +1]; throughout the subinterval [a, b] suppose p(x) 2 m > 0: Then, if 
xs” denotes the root of w,(x) nearest to x, we have for real x 


34 


We require two lemmas. 

Lemma III. In [—1, +1] suppose p(x) 2 po(x) 2 0 and both L-integrable. 
If w(x) and ws (2) pan the corresponding orthogonal polynomials respectively, k, 
and ki the respective Christoffel-numbers, x, and x; the respective fundamental 
points, then 


1 
= ky wt (at)? 


v=1 


1 
Proor. Let us consider the minimum of N(F) = [ | F(t) |’px(t) at 
1 


1 


amongst the polynomials of degree (n — 1), in which coeff. c"~ = 1. It is 
known that this problem has one and only one solution and that the minimum 
is assumed only for F(x) = wr_i(x). But we want to represent the solution in 
the form F(z) = >>? d,l,(x), where the 1,(x)’s denote the fundamental fune- 
tions of Lagrange-interpolation formed upon the roots of wn(x). It is evident 
that i in rm case we have to determine the minimum of the form >}. k, | d, |’, 
if > 7,7, = 1. From this, by applying Schwarz’s inequality once more, 


v=l1 


n 1 
we obtain for the value of this minimum [= ies and equality holds 


only for the polynomial 


1 
(37) f(x) _ 
(z,)? 
Thus 
1 


If p:(x) = p(x) and throughout the intervals [—1, a], [b, 1] p2(x) = 0, further 
fora S$ x S b po(x) = m, then 


Linea) min [ dt ’ 


| 
| 
Bai, 
| | 
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where the integrand is the linear transform of the (n — 1) Legendre poly- 
nomial with the normalization coeff. z""" = 1. Thus 


2n — 2\’ 
(38) vel m (b—a)?"-" 


if throughout [—1, +1] the L-integrable p(x) = 0 and throughout [a, b] p(x) = 
m > 0. Equality holds only if p(x) = m in [a, b] and = 0 outside of [a, }]. 

Lemma IV. For the fundamental functions of the Lagrange-interpolation 
formed upon any matriz IM we have 


= len(z) + 2 1. 
a” = 


Proor. Let 2 <k<n-—2. Then ¢(z) is a polynomial of degree (n — 1) 
at the utmost, vanishing at 2{”, , af"), 28”, ie. at nm — 2 
places and equals 1 at ap” and zh3}. Consequently its first derivative has one 
root in each of the intervals [x$”, a{”] --- zeta], --- 
z:”, 21], which determines at least n — 4 roots. In consequence of ¢(z{") = 
o(2i%}) = 1 one of the roots of this derivative lies evidently in [x?} , x4”. 
We now show that ¢’(zi”) < 0 and ¢’(z4%}) = 0. It will be sufficient to show 
the first. Suppose ¢’(z,) > ,0. Then ¢’(x) must have at least one root in 
[x<”, 24°]. But then ¢’(x) could not have more roots and thus ¢’(zx41) < 0; 
hence ¢’(z) ought to have one more root in [xg?2 , ze71], and this is impossible. 
¢'(2{t}) = 0 is to be obtained analogously. But then, if in (x{7}, xt”) there 
were a point with ¢(&) = 1 then ¢’(z) would have 3 roots in [xeti , zi]; an 
evident impossibility, which establishes the lemma. Fork = 1,2,n — 2,n —1 


the proof runs analogously. , 
Proor oF THEOREM III. Here also we start from > b(t) . By (38) we 


have—aj” has the meaning given above— 


n 1 1, (a)? 


p=1 
(39) b — a)?" n 2 


1 
But since kf” < [ p(t) dt (we made use of this at the upper estimate, too) 
then if e.g. 233} < 2 < 2§”, we may write 


m (b — a)*"" 
2n —2 


(40) > (laa)? + 2 — 


= 
| 

i 
A 

‘ther 

-1 


=x 
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Now by lemma IV for any z in the interval in question 
+ layi(a)” = $lla(z) + = 
i.e. by 


2 
(2n — 1) ~ 
we obtain that 


| wn(a) | > Af, p(t) al — |, 


which proves the theorem. 
THEorREM IV. Let us add to the hypotheses of theorem III that, throughout a 


subinterval [c, d] of [a, b], m S p(x) S Then, if denotes again 
the root of wn(x) nearest to x, we have in [c + ¢,d — ¢| forn > me, c, d, p) 


‘For the proof we require 
Lemma V. In[—1, +1] suppose p(x) = 0 and L-integrable; suppose further- 


more, throughout a subinterval [u, v] p(x) S v7 ie zi ; then for the Christoffel- 
numbers ki" belonging to the «\””’s lying in [u + 9, v — n] (n > 0), we have 


1 
< cel 4 [ v(t) au]. 


Proor. According to the first corollary due to Shohat of Lemma II we 
may write 


= min Pp@) dt 


f (26) )=1, 


= min 2 
F (9) cos (n—1yd [ | | p(cos d) sin 
(n)\2 
< [ 
0 + p(cos #) sin dé 
nsin 


fants 


| 
4 
ii 
t 
} 
La 
| 
Hed 
| 
f { 
| 
Ag 
| 
i 
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| 
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where u = cos B, v = cos a and C stands for the intervals complementary to 
[a, in [0, =]. Evidently 
sin n 

— 3,” n 


n* 


and 
1 
C39 
Ih < aint [ p(t) dt, 


which proves the lemma. 
Proor or THEOREM IV. From (39) 
(b — a)" "(x — 24”)? ie + 


(2n — 1) a) ka kay 


wn(x)” 2 


Ife +e S25 d—eandn > nc, d, p) then, according to footnote,’ 
the interval , containing x lies in [c + d — and hence, by 
applying lemma IV, and with the substitution u = c,v = d, 7 = }«, Lemma V 


we obtain for n > max (nm, , 4/€) = m 


2n—1 
w(x)? > cw — xf” 
M+[ 


which establishes the theorem. 


Remark I. In the special case m S p(x) S ee) throughout [—1, +1] 
we have by the aforesaid in [—1 + «, 1 — ¢] 


ca(p, €) |2 — | S | ca(p, 


Remark II. In lemma V we required the upper estimate of the Christoffel- 
numbers. Although we shall not use it, we mention, that if in [—1, +1] 
p(z) 2 m > 0, then 
2m 
= (1 — + 


equality only for p(x) = m; here P,(x) means the nt* Legendre-polynomial with 
normalization P,(1) = 1. By this 


25, 


rther- 
| 
4 
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Further it is easy to obtain that if in [—1, +1] p(z) = m > O, then 


SE, 
and, if in [—1, +1] p(x) 2 res. 2) holds, then 
n 


Remark III. If in Lemma V [u, v] = [—1, +1], then we have the sharper 


result 
Cas M 


Remark IV. We obtain from the proof of Lemma III 
2 1 @n(x) 


)= = 1 n 1 


where I,(x) are the fundamental functions of the Lagrange interpolation formed 
upon the roots of w,(z). As k, > 0 we evidently have for sufficiently small ¢ 


(see (2)) 
9, 
i.e. we obtained the well known fact that there is a root of w,-:(x) between 
each pair of roots of w,(2). 


SE — = —sg 


2. 
THroreM V. If for the fundamental functions belonging to the matrix IN 
(41) +1,k =1,2,---n,n> m0, 


holds for any sufficiently small ¢, then, at any fixed point z of the complex plane 
cut up along [—1, +1], we have 


2 
Here we must take those values of the roots which are positive on the positive real 
axis forz > 1. 

For the proof we require 

Lemma VI. From (41) it follows that, for any small positive , if n > ns(1); 


| 
i 
| 
| 
ail 
4 


rmed 


all 


ween 


e real 


ns(1); 
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Proor. Suppose that the lemma is not true. In this case there is a posi- 
tive absolute constant 6 and a sequence of positive integers ny < nz < --- such, 
that to any m we could give an integer », with 1 S »%. < m and 


| <4 6)”. 


But, as, according to a classical theorem of Tchebycheff, there is in [—1, +1] 


Wn, (x) 


a ¢ for which the value of the polynomial aga is not less than gre ?, at 
Ty, 


ghe same z = & 


ny (Ee) | > gue (3 — 6)" 5)"* 


which, for k + © contradicts (41) and thus proves the lemma. 
Remark. It follows from lemma VI and lemma I that (41) implies 


We shall make no use of this statement in this paper. 
Proor or THEOREM V. Let 
2v—1 

Ny an +2 

the roots of the (n + 1) Tchebycheff-polynomial 7',4:(z) (7, (cos 8) = cos nv), 


and represent w,(x) by the Lagrange-interpolatoric-polynomial taken at these 
n’s. If L,(z) (v = 1,2.--- (n + 1)) denote these fundamental functions, then 


ntl 
(42) wr(z) = L,(z) 


y= 1,2,--- (n+ 1), 


(zany number). But by (41) and lemma I, in —1 S$ z S +1, forn > m(e) 
we have 

wn (ay) (2 — 


|wn(x) |2""* <2 < 2(n + 1)(1 + ©)” 


ie. by | Thya(m) | = 1 from (42) for n > ns(e) 
8in +1) + 0" Vie — 1) 
| < 2” (n + 1)|z — 


ntl 1 


< 17/1 ve 


max 
|Z — 
hence, as z is not in [—1, +1], for n > ma(e, 2) 


na(€), 
plane 
j 
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Now let us consider the lower estimate and represent the (n — 1) Teheby- Q 


cheff-polynomial 7',:(z) by the Lagrange-interpolatoric-polynomial taken at t 
the roots of w,(z). We obtain 


As | 7'n-1(z») | S 1, there is an integer v with 1 < » S nand 
| > | | 
thus for n > ng(e) ( 
(43) | wa(z) | > (1 — + — 1) ||2 — |. 


From (43), by lemma VI, and as z is not at [—1, +1], we obtain for n > n¢(¢) 


2 


n—1 


+ Wie! — 1) 


1 
(44) | wa(z) | 2 on 


: | From (44) and n > n,(e, B) it is evident that in any closed bounded set B of 
| i the complex plane cut up along [—1, +1] we have uniformly 


where we are to take that value of the square root, for which the right side ~ z of 
forz— ©. This proves the theorem, since on the positive real axis, for z— ~, tk 
the two sides of (45) are equal without the sign of the absolute value too, and H 
upon the cut plane both sides are one-valued and regular functions of z. 
We already mentioned in the introduction that in the case of strongly normal 
polynomials, the asymptotic formula for [w,(z)]” upon the cut plane is a conse- 
quence of the above Fejérian-theorem. In order to state an analogous theorem 
for a class of orthogonal polynomials more general than that of Szegi’s we 
require a further lemma. 
Lemma VII. If the L-integrable weight-function p(x) is non-negative in w 
[—1, +1] and if its roots form an aggregate of measure 0, then for the fundamental C 
functions connected with the matrix p and n > ng(€) we have 
i Proor. We employ following theorem of E. Remes.” Let in [—1, +1] 
2 be given a finite set of disjoint intervals of total length 8. If throughout this 
7% 20 E. Remes: Sur wne propriété extrémale des polynomes de Tchebichef. Communications 
E! de I’ Institut de Sciences etc., Kharkow, 1936, série 4, XIII, fasc. 1, pp. 93-95. un 
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aggregate the absolute value of the polynomial f(x) of degree n is not greater 
than M then in [—1, +1] 
4 
where 7’, (cos 8) = cos nd. 
Now suppose lemma VII to be untrue. Then we have an infinite sequence 


of positive integers m <M <--- and a cy such, that to every n, there is a 
positive integer » with 1 S mand aé,, with —1 < &, < +1 to satisfy 


(46) | (Eu) | > (1 + Cas) "*. 


Apply now for 1,,,n,(2) Remes’s theorem taking for the aggregate of intervals 
of length # those intervals throughout which 


| (1 + 


from which 0 < & < 2 — c47 where cy depends wy upon Cy and is independent 
of m. Hence throughout intervals % of total length greater than cy, 
| T,ny(t) | 2 (1 + 4cy)"*. But then by the assumption made for the roots 
of p(x) we may omit from % intervals of smaller total length than }ce47 such 
that throughout the remaining a P(2) = cag , Where cyg depends only upon cq . 
Hence we had 


Thus we obtain 


[ p(x) dx = [ p(x) dx 
=> Cag dz > + 


which contradicts the Shohat minimum property of the 1,(x)’s (Lemma II. 
Coroll. I.). Hence lemma VII is established. 

By theorem V and lemma VII we state 

Tueorem VI. Let the weight-function p(x) be non-negative and L-integrable 
in [-1, +1] and assume the aggregate of its roots to be of measure 0. Then, 
taking the suitable values of the roots we have upon the plane cut up along [—1, +1] 


wuiformly in each interior domain. 
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3. 

In this section we consider c) problems. We prove following Fejérian 
theorem: 

TueorEM VII. Assume the matrix M to be such that [—1, +1] possesses a 
subinterval [b, a] = [cos B, cos a] with 

| (x) | S ca, k=vv+1,---p 
and for the other fundamental functions 
| | < coon" 
throughout [b, a], if 

Then 


[e(b a)P 1 < ofr) of” < Cag + a, b, Cw, 


if 4” and dif} are in [a + €, B — €, € denoting any small positive number. 
Proor. First we prove the lower estimate 


1 _ | &(cos of”) — 
| — of” oy” — 


| 


where 3” < 3* < dft}. But then, i (cos #) being a trigonometric polynomial 
of order (n — 1), by the Bernstein-Fejér-theorem we have 


1 “ — 1) 
| 3” | = — a)}*” 


which proves the lower estimate. 
Let us now consider the upper estimate. Suppose 


max — = = of, ate so” 


and we have to prove that A(n) remains smaller than a number independent 
of n. Wecan suppose A(n) > 10. Let r be the rit positive even integer 


of” (n) 
greater than [c;:] + 4, the largest integer with sn = bo 
and 

(47) (3) = + [sin 
= —-|| + 
sin bo sin ¢ 3 


} 
i! 
{ 
i 
| 
j 
| 
i | 
; 
| 
| 
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| 
=. 
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Then v(#) is a non-negative, pure cosine polynomial of order (n — 1) at the 
utmost, for which 


(48a) 2 1, 


and if, without any loss of generality, we assume 0 S 4) S ; 


1 1 1 2 
2 


9 sin 


Let us now represent g(#) by the nt Lagrange-interpolatory polynomial 
taken upon 9. Then by (48a) and (48b) we have 


1S 3”) L,(cos i) | < 


aot 


(49) 


Let us divide the sum upon the right-hand side into two parts with »y < k 
ory 2 k + 1 respectively. As, according to the already proved lower estimate 
in the first sum we have 


gin) — A(n) , (k— — a)} (n) € 
— > + for #3, zat; 
and in the second one 
&'S for 3," 9? 
we obtain that the sum on the right hand side of (49) 
1 
< 2n" 
[e(b — 
(4m 
< 1 < a, b)cagn” 
2 
asr 2 4 and 4 >5> ea (cy 2 1). By substituting this into (49) 
Wwe obtain 
1 < (C1 » & + Cia Cas(€, a, b, Cs) 


n2 A? ’ 


which establishes the upper estimate. 
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The consequences of this theorem for sequences. of strongly normal poly- 
nomials, we already mentioned in the introduction. For orthogonal poly- 
nomials we have 

TueoreM VIII. If the weight-function p(x) is non-negative and L-integrable 
throughout [—1, +1], and if, throughout the subinterval [b, a] = [cos 8, cos a] 


b P, < 3” < D, €) 


fate sh” < 
Proor. By (34a) we have forb Sx Sa 
2 2 


1 
Ask, < [ piijd,forb $2 $a, 


1 + 
Further a fortiori from (34b) 
€ 
k, ~ m(a— b) 


i.e. by applying lemma V with u = b, v = a, » = 4e for those 1,(x), for which 
b + de S 2\” < a — de we obtain 


€ € 
5S 5. 
b+-S28a 5 


Thus the peentiens of theorem VII are satisfied for [b + 3, a — }¢] with cy = 

| —b) [ p(t) aul , = 1; hence theorem VIII is proved. 

Remark I. From — III of lemma II _ from remark III of theorem 


IV it follows that if oe pee S p(x) S in [—1, +1], then 


Vv 2”) 
> Liz)" 
y=] m 


Remark II. By theorems IV and VIII we obiain that if in [—1, +1] the 
non-negative p(x) is L-integrable and in a subinterval [b, a] 0 < m < p(z) = 


} 
iit 
{ 
j 
> . 
} 
| | 
| 
1 
t 
( 
4 
j / 
¥ 
i 
| 
i 4 


oly- 
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M then | n(x) | takes between any two roots lying in [b + «a — ¢ 


We already mentioned in the introduction that if in the subinterval the 
weight-function is supposed. to be continuous there are asymptotic theorems 
to be obtained. For sake of simplicity let us ascribe to p(x) besides the prop- 
erties of being non-negative and L-integrable in [—1, +1] also continuity and 
positiveness of p(x) (1 — zx’) throughout [—1, +1]. The results are new for 
this case, too, but the argument is very much clearer. 

In (34c) for = & = cos g we had 


1 2 

&) 1 4 1 sin (2n — 1)¢ 


SIN go 


Consider the polynomial 
T'n—1(Eo) — Tn(Go) | 
— — 14 Lain Gn — 


(52) 


with T(z) = 1/+/2, T, (cos #) = cos wd,» = 1. It is evident that for n > 1 
¢,-1(f) = 1 and that by the formula of Christoffel-Darboux (which holds for 
n> 1) 


we may write forn > 1 


¥ 7.6) 7.(2) 
2 sin 
T (Eo) SIN go 


which means that ¢,1(z) is that polynomial which, amongst the f(z) 
polynomials of degree (n — 1) with f(f) = 1 minimizes the integral 


dt 
By (54) we have for any -1 S$ & S$ 1,-1 S$ +1 


(55) | | S Ca. 
For ¢n-1(z) we require following two lemmas. 
Lemma VIII. If forn © ng +0 and n(x — go) then the dis- 
lance between and the root of (cos 8) = nearest to = go is ~x/n. 
PRoor. According to (52) it will suffice to consider the roots differing from 
= of the equation 


(56) cos (n — cos — cos ngo-cos (n — = 0. 


| 
By 
rem 
the 
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As each of the (n — 1) intervals given by the n different real roots of 7',(x) = 
contains just one root of the equation 7,1(z) = 0 it is clear that any equation 
of the form AT,(z) + uT'n-1(z) hence also ¢n-1(x) = 0, have a root in the interval 


1 COS 2-1 Thusequation (56) has a root 


2n 2n 
in each interval of length 2n/n and consequently for the rightwards root ¢, 
next to g we surely have | go — go| S 24/n. Then by simple transformation 
we obtain from (56) 


sin (n — 9) = 0 


* sin (n — 3)(8 + go) — sin 


2 


which for ng > ©, n(x — go) — © immediately leads to A > x if gy = 
go + A/n. 

Lemma IX. If for the weight-function p(x) in [—1, +1], g(x) = 
p(x)/(1 — 2”) = m > 0 and q(z) is continuous throughout the same interval, 
then for 0 S go S 7, = COS 


1 
1 sin (2n — 1)q@ 
Sin 


1 
tim at = — lim 


1 +5 


1.€. if noo > ©, — go) 
1 


Proor. For |x — &| = 1/n' it is easy to obtain from (52) 


| | s 20n? 
i.e. 
(57) at < p(t) dt. 


+1 
|t—Eo| 


If | p(x’) V(1 — x”) — (1 — &)| S 6 for any |x’ — &| < we have 


-1 


(58) 


400 


( 


ith | 
| 
| 
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| 
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“i From (57) and (58) evidently 


wp (Eo) V (1 — 
‘oot sin 
i- 
400 1 
< [ p(t)dt + max — + Q.e.d. 
n 1 
Remark. If p(x)/(1 — 2’) = q(x) has at x = & a discontinuity of first 
order, then g(%) at the right-hand side in Lemma IX is replaced by 
= q(t + 0) + g(g — 0) 
5 
= Lemma X. If for the polynomial f(x) of degree k f(t.) = 1, & = cos g and 
val, any positive «and 6 < 1, k = 16/6 and sin g > 8/ké the inequality 
S(t)’ 7 
sin 
holds, then 
142)" 
1 f(t)? 
1 — 400k 
Remark. This lemma means that if the quadratic integral of the poly- 
nomial normalized for 1 at 2 = £ is “too small” in the interval [t — ¢, + él], 
it must be very large in some other parts of [—1, +1]. 
Proor. Without any loss of generality we may suppose & = 0. Then 
construct with the above f(z) 
(5) F 
@) = 
a are whose degree is less than k(1 + 36). In [—1, +1] we evidently 
have <1. From (34c) 
F 
ave (60) = ; 
sin | 2k +4]—|+1) ¢ 
4 2°3 sin go 


on the other hand 


in 
L 
‘ 
dil 


| 
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as in [—1, +1] the second factor of F(x) is non-negative but S 1. Making 
use of the hypothesis after the arrangement we obtain from (60) and (61) 


ud | 1 


(1 
i-? > 4 


100K? 
2 2 SIN go 


Q.e.d. 


According to what has been said before we may deduce asymptotic formulas 
for the Christoffel numbers belonging to p(z). 
THEorEMIX. Let p(x)+/(1 — 2”) be continuous and p(x)+/(1 — 2”) > m>0 


in [-1, +1]; then, if n we have for any lying in < 


n 


= 1,(t)* p(t) dt ~ 


Proor. First we show that we have for any ¢ and 6 independent of n if 
n > €) and |2i” | < E 


n? 
1,(t)” 
t 


Suppose the contrary, then by lemma X we had 
1 2 + 
[ 1,(t) di > 4 
1Jf(1 — 500n 


ie. a fortiori 


[ dt =m 


| 
1 
1 V(1— (1 — &) 
1 ( 
| 
| 
FT 


ing 


if 
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which contradicts the minimum-property of Shohat if n is sufficiently large. 
Thus (62) is proved. But then by (62) for n > ng(6, e) 


1,(t)° 
(63) (n)2 
> (1 8) )V(1 — ) 


1 sin (2n — 1)8;"” 
2 sin 3” 


5+ 


On the other hand by the minimum-property 


— 
1, 1 sin (2n — 


2° 2 sino 


1 1 
(64) p(t) dt < $na(t)* p(t) dt ~ 


by lemma IX, if we replace & by 2”); (63) and (64) lead to theorem IX. 
And now we may go over to the asymptotic representation of the funda- 


mental functions. 
TuzorEM X. In [—1, +1] let p(x)\/(1 — 2’) be continuous and such that 


p(z)V(1 — 2) > m> 0. Then, if > 0, n > < - 
for -1 S x S +1 we have uniformly (T, (cos 3) = cos rd, r > 1) 
T n(x) — 


1, 1sin (Qn — 
(n-3+5 sin 0” 


— | = — 


Proor. Let us consider, with the above ¢n-:(x), the integral 
1 
(65) 1, = [ — ae. 


AS = and = 1, we obtain 


n 1 
(66) 
1 1 
+ dt = + at 


(n) log 
ut then by theorem IX and lemma IX, forn —> |2,""| S| 1— 

uniformly in » 

(67) lim nI, = 0. 


d. 
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From the premise and the remark to theorem VIII it follows for [—1, +1] and 
y=1,2,---n 
M 
| 


By this and (55) for 1, (cos 8) — dn-1 (cos 8) = (8) we obtain 

| ¥(9) | cap). 
But then by the theorem of Bernstein-Fejér, in [0, 7] we have 
(68) | ¥'(8) | S 
Let (0) assume its absolute maximum in [0, 7] at & = yo and let this maxi- 
mum value be D,. Then in the subinterval 7 = | yo — iets »Yo + | of 
[0, x], by (68) we have 


D, _ D, 


i.e. 
(L(t) — at = [ (cos 8) sin do 


D, D; 
Con 4° 


> m ¥(8)? di > m 
(4) 


Thus by (67), for — © a fortiori 
mD} 


i.e. D, — 0, which establishes the theorem. 

From theorem IX and lemma VIII we easily deduce 

TueorEM XI. Let p(x)V/(1 — 2°) be continuous in [—1, +1] with 
p(x)V/(1 — 2”) = m > O, further let C(n), for n > @~ arbitrarily slowly, tend to 
infinity; then for those roots cos 3” of the n* polynomial orthogonal to p(z), 
which satisfy 


— 0 


< < oi sx - 


we have uniformly in k 


lim — df”) = 
Remark I. If we assume not p(z)./(1 — 2’), but p(x) to be = m and con- 
tinuous and not in [—1, +1] but in the subinterval [a, b], then ¢,-1(z) is to be 
replaced by a polynomial of similar form but the Tchebycheff polynomials 


id 
i 
= 
i 
|. | 
bau 
| 
id 
| 
t 


and 


axi- 
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T,(z) are to be replaced by P,(x) Legendre-polynomials. Theorem IX and X 
remain true for those k, and 1,(z), for which a + ¢ S$ a” S<b—eanda+esxs 


sb-« 
’ REMARK Il. If we attribute again continuity and positiveness in [—1, +1] 
to p(z)V (1 — x’), it is probable, that theorem X holds for the fundamental 
functions belonging to every eae Theorem XI does not hold for every zi” in 
the original form; the difference 3441 — #;” will be asymptotically equal to the 
distance between 3” and that root of 


cos (n — 1)d{” cos nd — cos nd{" cos (n — 1)8 = 0, 


which is nearest, on the right, to 3”. 


4. 


In this section we consider the number of roots of polynomials in a given 
interval. We already mentioned in the introduction that if 


(69) lim [| L(x) <1 


uniformly for —1 S$ x S +1 and » = 1, 2,---,n, then the fundamental 
points of Mt are uniformly distributed. We present an elementary proof for 
this Fejérian-theorem. Here and also later a theorem of M. Riesz” plays a 
most important part, so—because of its shortness—we reproduce it as 

Lemma XI. If a trigonometric polynomial of order n, f(d), takes its absolute 
maximum in [0, 2x] at & = dp , then there is no root of f(3) in} do — > do + 

Suppose the theorem to be untrue. Without loss of generality let % = 0, 
f(0) = 1 and suppose, that the nearest root, the distance of which is less than 
1/2n, lies to the right. But in this case the curves y = f(#) and y = cos nd 
would have at # = 0 at least a double point of intersection and by the premise 
a third one in [0, x/n]. In any of the intervals [x/n, 2x/n], [2x/n, 3x/n], --- , 
[(2n — 2)x/n, (2n — 1)x/n] they would also have at least one intersection; 
hence the trigonometric polynomial f(8) — cos nd of order n had in [0, 27] 
(2n + 1) roots, which is impossible. 

Corottary. If a trigonometric polynomial of order n takes its absolute mazxi- 
mum between two real roots, the distance of these roots cannot be less than m/n. 
The statement holds for all n. 

TaroreM XII. If upon the matrix M 


"site k=1,2,---n, -1S2 41, 
forn > no(e), then we have foranyO Sa<BSr 
no 
asd 


1M. Riesz: Eine trigonometrische Interpolationsjormel usw., Jahresbericht der deutschen 
Mathematischerver, 1915, pp. 354-368. 


__| 
D} | 
be = 
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Proor. If the theorem would be untrue, we had in [0, xz] a subinterval 
[a, B] and a ces such, that there would be an infinity of integers n, < m <... 
for which the number of the n,** fundamental points lying in [a, 8] is less than 


We may assume ca <° =“ and write instead of ngsimply n, 
Let 

(70a) 


where k runs over the integers (positive, negative and 0), for which ¢, lies in 
[o, a+ =| ; for n — © the number of these yg, asymptotically equals 


+ n. Let further 


n+ 1’ 


where / runs over the integers (positive, negative and 0) for which y, lies in 


(70b) 


E _ +, r| for n — o the number of these asymptotically equals 


—Bt+ Let further 
4 
(71) G(x) = IT’ (x — cos 3") I (x — cos yx) et (x — cos 1), 


where ||’ is to be extended over the 3” lying in [a, B]. The degree of G(z) is 
by the premise and the definition of g, and y; , for sufficiently large n, 


1 1 Cos 1 _ (, _ 
(72) <i @-a-cont (at +%)n=(1 


As the order of the trigonometric polynomial G (cos #8) is less than n and the 
distance of its consecutive roots in fo a+ =| and E - , | is less than 
a/n, G (cos #) takes, by lemma XI, its absolute maximum in [« + 
at a place 8 = y say. Let finally 


where the brackets in the exponent denote the greatest integer contained. 
Then, by (72) the degree of F(z) is 


(74) | 


an 
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and, like G(x), F(x) takes its absolute maximum at 3 = y, too. Then 
F(x) = 
i.e. for = COs 


where, by definition of F(x), >)” refers only to those #;"’, which are not in 
(a, 6]. But then, by the hypothesis, for n > mio(e) 
|@(cos y) | < (1 +6)” 20” | Flas”) | 


< D1 Ges”) 


cggn/8x 


As by definition of y 
| G(cos | 2 | q 1, 2, 


we have a fortiort 


4 


4 


( (os (6 7) 


4 


which is, with sufficiently small ¢, untrue for n > nu(e) and thus the theorem 
is proved. 

By theorem XII and lemma VII we obtain 

TuHeorEM XIII. Let p(x) be in[—1, +1] non-negative, and L-integrable further 
suppose that its roots form an aggregate of measure 0; then for the roots of the n 
polynomial cos 3" belonging to p(x) we have 


| 
asd (n) <8 


where [a, 8] denotes any fixed subinterval of [0, 7]. 

If we want to secure uniform distribution with error-term, we require the 
following Fejérian 

Tueorem XIV. If for a matrix 


SD, 41, =1,2,---n, n=1,2,---, 


3 


i 
i 
n 
Is 
| 
n | | 
| 
PY. 
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then for the elements of the n** row, cos ee oe ae , n) and for any sub- 
interval [a, B] of [0, +] satisfying (8 — a)n 2 ceo(D, €) we have 


aso(n)<p 


where we emphasize that cw(D, €) is independent of a and 8, too. 
Proor. Consider first the upper estimate. Let [a, 6] be a fixed subinterval— 


without loss of generality we may suppose 8 — a S 3 +: wah. - n| = k and 
assume 
(75) 1=k41 


v 
asd 


From a rightwards let us cut off k-times the distance 2/n and leftwards [}I]-times 
until we reach A, further from 6 rightwards also [}/]-times until we reach B; 
if a — r/n [31] < 0 or B + x/n [31] = x, set A = 0, correspondingly B = r. 
Let the points of division be g,. Let further 


(76) = I] (« — cos y,) IT’ (x — cos 


where ||’ in the second product runs over the 3”) lying outside of [a, 8]. In 
this case G,(cos #) is a pure cosine-polynomial, whose order S$ n — k — 1+ 
k + 2[41] <n — 31. Then the distance of two consecutive roots of G:(cos #) in 
[A, B] is not greater than 2/n, i.e., by lemma XI G;(cos #) takes its absolute 
maximum outside of this interval, at a point ? = A, say. Let 


(77) Fi(z) = Gila) Tun (-1 ) 


cos a — cos B/’ 
where 7'yn(cos ¢g) = cos [3l]g. As the degree of F(x) is not greater, than 


n — 4l, we represent F(x) by the Lagrange-interpolatory polynomial formed 
upon the n* row of the matrix IN and obtain 


i.e. 
(78) Fi(cos | = > d) | = | 20” d) |, 


where ),” runs over integers, for which 3{” lies in [a, 8]. As forany1 <»<" 


| Gi(cos d) | 2 | Gi(as”) | ’ 


| 
(7 
| 
ik 
H If 
| | 
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by (78) and the hypothesis we have 
cos A — cos B | 


<D ‘Tun(- 1+2% 
aso 


(79) 


cos — cos 


cos a — cos B 
Each term of the right-hand-side-sum is not greater than 1 and the number of 
terms is less® than en(D)k, i.e. by (79) 


| cos A — cos B 
Tun S en(D)-k. 


(80) 


If A or B fall upon one of the borders of 10, x] \ cannot be there, thus from ) it 
may be assumed 


Without loss of generality we may suppose \ = 6 + [jl] x/n. Then we have 


>142 


cos a — cos B 


cos A — cos B 


Suppose 0 < a; < az < ag S a3 — = — = ; then we obviously 
have 


61 
COS — COS a3 _ (« + 


COs a1 — COS as sin sin (as 
2 


(82) 
2 
sin 2 
sin sin (3 + bab + + 62)" 


Casel. 12k = 20. We apply (82) with o: = a, a2 = B, as = B + [fllx/n; 


then we have 
and 


(83) > on. 


cos a — cos B 
As T,(x) increases monotonely for z = 1 and satisfies for p = 0 the inequality 
(84) + p) 2 4 (1 + V(20))’, 


2 
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we have from (80), (81), (83) and (84) 
ch, k 


l < en(D) log k 
which together with k < 1 gives k S ko(D). 
Case Il. 1 <k,k 2 ky(D). In this case 6; S < < an d 


cos a — cos B be k 
From (90), (81) and (84) we have 


1 
Tan (1 ({)) <en(D)k 


< co(D) (k log 


Let us now consider the lower estimate; when again k - : n| = k, let 
(85) 


Thus we have to estimate | _ above. Now cut off from a to 0 leftward dis- 
tances of the length (1 + ink =) as many times as possible and rightward 


[4/]-times until A’; furthermore, from 6 rightward to x as many times, as possible 
and leftwards [4/]-times until B’. As 


Lian lx 


we have A’ < B’. We denote the points of division by ¢,. Let 
(86) Gx(z) = IT (@ — cos g,)- I] — cos 
« 


where the second product refers to the #{””s lying inside of [e, 8]. In this case 
G2(cos #) is a pure cosine-polynomial of order 


(x — B) 


<k-5+ (x — (6 —'a)) 
r( 
10nk** kar 
n l n 
"+ T+ 10h + 2 102 


Wi 


10; 
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Suppose that 1 > 3k*** is true; then the order of G2(cos 3) would be 


n 
(87) 


and after lemma XI the place 8 = A, on which G:(cos #) takes its absolute 
maximum could be only in a root interval, whose length is 


+ 
degree of G2 (cos 7 10k*-« 
But on the outside of [A’, B’] the distance of the consecutive roots of G2(cos #) 
ws Fi + ae so \ could be only in [A’, B’]. Without loss of generality 


letOSAS 5: then we have for 0 Svar 


C81 


X 
n 


n 
With this cg; let M be the greatest odd number not exceeding 400 cg; ioe + 1 


k** 


N N 
i M sin M 
(88a) (cos #) =a I+) + 
2 2 
and finally 
(88b) = 
200can k** n 
Th 
e degree of the polynomial ¥(x) is —— ly < 400ce, < 


7 and according to (87) and (886) the degree of F2(x) is S n — 1, i.e. 


(cos d) 
< D>’ | |, 


[Fs (cos ») | = | (cos 2) | 
(89) v=1 


the last two summations refer to the 3{””s lying outside of [a, 8]. As | G2 (cos d) | 
2 | Ge(2;”) | (v = 1, 2, --- , n), we should have from (89) 


(90) |v (cos 4) | < (cos 8{”) |. 
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It is easy to see that 


¥ (cos d) = 1, lv (cos #)| < 2 Gags)» 


and comparing this with iat and with our theorem, we have 


1 < 2D 


(91) 


1 1 1 


1 1 1 
Bur for 1 > ¢s3 we have 


and (91) gives k > cg 


further for 1 > cg; we have 


40cgin N > k 


which gives from (92) 
N k2£/800cg1 
1< ) < 
Cee 


This means a contradiction for 1 > cg(D, €). Q.c.d. 
THEOREM XV. If upon the matrix M 


then for any subinterval [a, 8] of 


- < ceo(cer , Ces, €)n*** 


asd <p 


Proor. It will be sufficient to prove that for any subinterval the upper 
estimate holds, as in this case the respective application for the intervals [0, a] 
and [8, z] leads to 


a 
> 1-- n| < 


i.e. 
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and respectively to 
> 1- n| < 
sx 


i.e. 


asd (n)<p 0<0(")<a ("<x 


which establishes the lower estimate. The proof of the upper estimate is com- 
pletely analogous to that applied in theorem XIV. 

For a sequence of strongly normal polynomiais, theorem XIV immediately 
presents the uniform distribution of roots in [0, x] with the error-terms men- 
tioned, but we do not state this in a separate theorem. For orthogonal poly- 
nomials according to (50) and to the first remark appended to theorem VIII 
we may state that if throughout [—1, +1] we have p(x) = m > 0 and L-in- 
tegrable, then 


1 4 
y=1,2,---n, n=1,2,---, -lSzs +l, 


or respectively, if in [—1, +1] is p(x) L-integrable and m S p(x) (1 — 2”) $ M 
then 


[ow], -ls2rs+tl, v=1,2,---n, n=1,2,---. 


Hence theorem XV and XIV are applicable and we obtain following two the- 
orems: 

TororEM XVI. If the weight function is L-integrable and satisfies in [—1, +1] 
p(t) = m > 0 and the roots of the n* orthogonal polynomial are cos 35", then for 
any [a, 8] of [0, x] we have 


aso 


TuroreM XVII. If the weight function p(x) is L-integrable and satisfies in 


[-1, +1]0 <m S p(x) V(1 — 2’) S M, then for the roots of the n™ orthogonal 
polynomial cos 35" and for any subinterval [a, 8] of [0, x] we have 


< en(p, €){(8 — a)n}*** 
aso 
if n(8 — a) > coa(p, 


INsTITUTE FoR ADVANCED Srupy, Princeton, N. J.; 
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A REMARK ON A PRECEDING PAPER BY J. VON NEUMANN 
By IsragL HALPERIN 
(Received August 11, 1939) 


In a preceding paper ‘On rings of operators, III” J. von Neumann has de- 
veloped a theory of normed operators for operators in an arbitrary factor 9n. 
In part the procedure is as follows: First [[A]], the norm of A, is defined for all 
A eM of finite rank (Definition 1.3.1) and fundamental sequences of such 
operators and an equivalence relation for fundamental sequences are defined 
(Definitions 1.4.1, 1.4.2 and 1.4.3). Then the key theorem (Theorem III) is 
proved that every fundamental sequence is strongly convergent operatorially, 
that two fundamental sequences converge strongly to the same limit if and only 
if they are equivalent, and finally that the norms of a fundamental sequence 
converge numerically to a limit depending only on the operator limit of the 
sequence. 

The proof of Theorem III makes use of two important lemmas: 

Lemma 1.4.1. Jf A,,n = 1, 2, --- ts a fundamental sequence with [[A,]] — 0 
asn— ©, then A, converges strongly to 0 asn 

Lemna 1.4.3. If A,,n = 1, 2,--- ts a fundamental sequence with A,, con- 
verging strongly to 0 as n then [[A,]]| ~ 0 asn ~, 

The purpose of the present note is to point out how the proofs of these two 
lemmas can be simplified by using more fully the results of the paper ‘On rings 
of operators, II’ by Murray and von Neumann.’ We require the following 
lemma. 

Lemma. If OW is a factor in a finite case and A, ¢e ON with A, converging 
strongly to 0 as n — then [[A,]] ~O0asn— 

Proor. There is a finite set of elements gi, --- ,gm such that [[A,]]’ = 
Tray (AnAn) , gi) = (Angi, Angi) = || Angi ||? 
for alln.* Since A,gi > 0 as n for all 7, and mis finite, the lemma follows. 

Proor or Lemma 1.4.3. Suppose the lemma false, if possible. Then for 
some ¢ > 0 we could assume that [[A,]] > ¢ for all n, by choice of a suitable 
subsequence. For some fixed no, [[A, — A,]] < ¢/2 for alln,r 2 m. From 
the definition of a fundamental sequence, A,, is contained in some closed linear 
manifold IM of finite dimensionality relative to ON. Let E be the projection 
on M. 


1These Annals, vol. 41 no. 1 (January 1940), pp. 94-161. 

* The writer is greatly indebted to Professor von Neumann for the opportunity to see 
the paper in manuscript and for the suggestion to publish this note. 

*In particular the use of Lemma 1.3.2 of ‘On rings of operators, III’ will be avoided. 

‘This uses Theorem III of ‘On rings of operators, II’. 
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Then EA,E, n = 1, 2,--- is a fundamental sequence converging strongly 
to 0 as n — ©, in the factor Mum) which is in a finite case. The preceding 
lemma now implies that [[ZA,E]] ~Oasn— ». But 


=>e— ¢/2 = 


for alln = m. This contradiction shows that Lemma 1.4.3 cannot be false. 
Proor oF Lemma 1.4.1. The factor 9M of operators in the Hilbert space 
is in one of the classes (I), (II) or (III). If 9M is in class (III) the only funda- 
mental sequence is that with A, = 0 and hence the lemma is trivially true. 
If Mi is in class (I) or (II) we can consider § as a sum of closed linear mani- 
folds © = >.° M,° such that Theorem II of ‘On rings of operators, II’ applies 
to every Mm, Thus for every r there is a fixed element g, ¢ Mt, such that 


Trey (A) = (Agr, gr) forall and [Bg, ; Be M’] = M,. 

Now let An, » = 1, 2,--- be a fundamental sequence with [[A,]] — 0 as 
Let E, be the projection on Then 

\| Angr |? = (AnBge, Ange) = (E-AnAnE Gr 
= Troy (E-AnAnE;) (An An) = 

It follows that A,g, ~ 0 as n — ©, for every r. Since A,(Bg,) = B(A,g,) 
for all B eM’, and such B are bounded, it follows that A,(Bg,) ~O0asn— ~. 
Since these Bg, and their finite sums are everywhere dense in §, and since the 


A, are uniformly bounded, it follows that A,f — 0 for all f «5, which proves 
the lemma. 


Harvarp UNIVERSITY. 


‘The number of summands may be finite or countably infinite. 
‘This uses Lemma 11.4.3 of ‘On rings of operators’. 
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ON REGULARLY CONVEX SETS IN THE SPACE CONJUGATE TO A 
BANACH SPACE 


By M. Krern V. 


(Received June 6, 1939) 


INTRODUCTION 


Here and throughout this paper we denote by E a Banach space (a linear, 
normed, and complete space) and by E* its conjugate space.’ 

A set K C E* will be called regularly convex if for every g ¢K (g ¢ E*) there 
exists an element 2 ¢ E such that 


Sup (x0) < g(a). 


The space E* itself we consider as regularly convex. 

Any closed sphere o(fo; p) (fo ¢ H*, p > 0), ie. the set of all elements f ¢ E* 
such that | f — fo| S p gives us one of the simplest examples of a regularly 
closed set. We obtain a still more simple example if we consider the set of all 
f « E* such that c, < f(y) S ce where y is an arbitrary element from E and 
S any real constants. 

Observe that if K is a linear subspace of E* then the expression regularly 
convex denotes the same as regularly closed (see Banach [2], p. 116). 

It is easily seen that only convex sets in E* may be regularly convex. Fur- 
thermore, every regularly convex set is always closed and moreover weakly 
closed in the sense of weak convergence of functionals. 

It is also clear that the intersection of regularly convex sets taken in arbi- 
trary number is empty or is also a regularly convex set. In view of this fact, 
for every set S C E* there exists the smallest regularly convex set containing 
the set S (namely, the intersection of all regularly convex sets containing 5). 
This set K will be called’ the regularly convex envelope of S. 

In the first chapter of our paper we state some general properties of regu- 
larly convex sets, in particular, in §1 we study the properties of bounded regu- 
larly convex sets. Theorem 1, which gives a representation of elements of the 
regularly convex envelope of a bounded set by the elements of the set itself, 
is the central theorem of this paragraph. 

On the basis of this theorem we show that the linear aggregate and the convex 
envelope of a finite number of regularly convex bounded sets are also regularly 
convex. Most of the theorems of §1 may also be obtained by developing some 


? The terminology used will be that of Banach’s well known book [2]. Numerals in 
brackets refer to the list of references at the end of the paper. 
2 In analogy with the definitions of linear envelope or convex envelope of a set. 
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transfinite methods proposed first by Banach (see [2] p. 116-122). Establishing 
a simple lemma we have succeeded in finding a quite elementary method of 
proving all these theorems. Formulated in a more particular form, this lemma 
has been used in the classical moment problem long ago (see M. Riesz [12]; 
Achyeser-Krein [1], Article 2; I. J. Schoenberg [14]; L. Rasa [5]). But 
it seems that in this paper the lemma is for the first time applied to the study 
of regularly convex sets. In §2 we extend a part of our results to regularly 
convex unbounded sets. The fundamental result of this paragraph is that a 
set is regularly convex if and only if its intersection with every bounded regu- 
larly convex set is regularly convex. 

The proof of this theorem is based on an idea of Banach’s as in §1 Banach’s 
transfinite methods are not used. Only after having established all the main 
properties of regularly convex sets do we prove in §3 that a set is regularly 
convex if and only if it is convex and transfinitely closed. This criterion is a 
generalization of that of Banach’s that gives the conditions under which a 
linear subspace of E* is regularly closed (convex). The idea of our proof 
differs somewhat from that of Banach. Furthermore, we show that it is useful 
to define transfinitely closedness in a somewhat different way than it was 
defined by Banach. In §4 it is shown that the criterion of regular convexity 
of a set may be considerably simplified if the space EF or the set itself are 
separable. 

In chapter II we show that the study of factor-spaces introduced by Banach 
((2], p. 232) and Hausdorff [4] is closely related to that of regularly convex sets. 
For instance, it is shown that a linear set belonging to E* is regularly closed 
(convex) if and only if it is the conjugate space to a factor-space E/G (G C E). 
By means of similar propositions we state that if G is a linear subspace of EZ 
then (@*)* is a linear subspace of (H*)* (see §1); if Z is regular ((Z*)* = E) 
then so are G and E/G, and conversely (see §2). If the unit sphere of £ is 
weakly compact, then the spaces G and E/G both have the same property, and 
conversely (see §3). 

In chapter III we study the regularly convex envelopes of weakly compact 
sets. As has been remarked by S. Mazur [9], the convex envelope of a compact 
set of Fis compact. Using considerably more complicated means, we state the 
analogous propositions for sets from E or E* which are weakly compact in any 
sense. The case in which the set belongs to E has been already discussed in 
another place (see M. Krein [7] and V. Smulian [17]). The results concerning 
this case are now obtained from a more general standpoint. The theorems on 
convex envelopes of weakly compact sets permit us to formulate the fixed point 
theorems of Schauder in a somewhat more general form than formulated by 
Schauder [12]. * 


CuHaPTerR I 


1, Bounded regularly convex sets. Let S be an arbitrary set. Denote by 
Qs the linear space of all real-valued bounded functions ¢(s) defined over S 
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with the usual definition of the norm 


Ile ll = Sup | ¢(s) |. 


An additive homogeneous (distributive) functional p defined over Qs shall be 
said to be nonnegative, if o(s) = 0 (s « S) implies p(v) = 0. Such a functional 
is always continuous, i.e. belongs to Q: (Qs is the conjugate space to Qs). In 
fact, for every ¢ e Qs we have 


\| ||-e(s) + g(s) 2 0 


where e(s) = 1 (s eS). Therefore, p being nonnegative we obtain || ¢ ||: p(e) + 
p(y) = 0, and consequently 


| | 
= Sup _—__ = ple) < @, 
A nonnegative functional p will be said to be normalized if || p|| = 1 


(p(e) = 1). The set of all nonnegative normalized functionals p ¢Q? will be 
denoted by $s. | 

Lemma.’ Let L be a linear subspace of Qs and let e(s) eL. If a distributive 
functional jo defined over L takes nonnegative values over nonnegative functions 
ge, then there exists a nonnegative functional » «€ Qs such that p(y) = poly) for 
all L. 

Proor. Let y:(s) «Qs — L. Denote by ¢’(s) € Qs (or g’’(s) € Qs) any fune- 
tion such that for all se S 


(1) o'(s) S vals) (or vals) S 9"%(s)). 


Such functions exist for e(s) and —|| ||-e(s) S S || ||-e(s). 
It follows from (1) that always p(y’) S p(y”) and therefore that there exists 
such a & such that 


(2) Sup p(y’) S & S Inf p(y”). 


Consider the linear space L; > L formed by all functions y of the form: 
¥(s) = o(s) + w&(s), where geL and ¢ is an arbitrary real number. Put 
pi(v) = p(y) + té. The functional p; defined over L , is obviously distributive. 
It is also easy to see that in virtue of (2), the functional p, takes nonnegative 
values over nonnegative functions y ¢ L; . 

Thus we see that it suffices to well order the set Qs — L in order to arrive 
by the above described extension of the functional po(y) to the functional 
p(y) looked for. 


* It may be easily shown that if an additive functional p has the property that ¢(s) 2 9 
implies p(y) 2 0, then it is necessarily homogeneous and consequently distributive, i.e. 
pie + wy) = Ap(y) + uply) Q,). 

* This lemma is a particular form of a more general one (see Krein [6]). As was indi- 
cated in the introduction, this lemma has its origin in the classical moment problem. 
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From now on S will denote a bounded set in E*. In this case to every xe E 
corresponds a function ¢:(f) ¢ Qs defined as follows ¢.( (f) = f(z) (f S). Ob- 
viously ¢arsty(f) = ag:(f) + by,(f). Now every pe Q: generates a functional 
g(t) by the formula g(x) = p(y«). Obviously g E* since g(x + y) = 
+ g(v) and | g(2)| || p||-Sup |fl@)| < We shall write for g, 


= p(S). 
TuzoreM 1. Let S be a bounded set in E*. Then the regularly convex en- 
velope K of S coincides with the set of all elements g « E* which may be obtained 


by the formula 


(3) g = p(S) 
where € $s 

Proor. Denote by K; the set of all g ¢« H* which may be obtained by for- 
mula (3). 


The set K; contains the set K. In fact, to every point fo « S corresponds the 
functional ps, Bs defined as follows = o(fo) (g €Qs), and for which 
we obviously have py,(S) = fo. 

The set K, is regularly closed. In fact, let g¢Ki. Denote by L the linear 
subspace consisting of all functions ¥(f) € Qs of the form ¥(f) = ¢.(f) + ce (f) 
(reE, <c< Put poly) = go(x) +c. The functional po defined over 
L in general may be many-valued. We state that there exists at least one 
function 


Yo(f) = flr) + = 0 (fe 8) 
such that 
(5) = golte) <0. 


In fact, admitting the contrary, ¥(f) = 0 (feS) implies p(y) = 0 and, 
in particular, if ¥(f) = 0 (fe S) then po(y) = 0, which shows that po is single- 
valued. As po is also additive and e(f) e L, there exists by the Lemma a non- 
negative functional p such that p(y) = poly) (We L). But then |p| = 
pe) = po(e) = 1, ie. pe Bs and furthermore go(z) = po(yz) = (x 
Le. go = p(S) e K,, which contradicts our supposition. 

Thus there exists an element 2 ¢ E and a constant c) such that (4) and (5) 
hold and consequently 


Sup f(—20) < 


which shows that K; is regularly closed. 

Thus K, being regularly closed and containing ‘Ss, contains also K. It re- 
mains to show that every element of K; belongs to K. Admitting the contrary, 
let goe Ki and ¢K. Then K being regularly closed there exists an element 2 
such that Sup f(%) < go(ao). On the other hand as go ¢ Ki, there exists a 
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Yo € Bs ont that g = po(S). Therefore 
go(xo) = po(f(xo)) S Sup f(m) Sup < go(x), 


which is impossible. The theorem is thus proved. 
Corouuary 1. For every xe 


Sup f(x) = Sup g(z). 
fes geK 
In fact, if g eK then g = p(S) (p € Bs) and therefore 
g(x) = p(gz) S Sup ¢(f) = Sup f(z). 
fes fes 


Coroutiary 2. If for any pair x, EF, x2 = for f eS, then 
= g(x2) for ge K. 

In fact, if g = p(S) (p Ps) then = p(yz,) = = for = 
= f(x2) = ¢2,(f). 

Remark. Let Rs be a linear subspace of Qs containing the function e(f) = 1 
and all functions ¢,(f) (x ¢£). Denote by 5 the set of all functionals p ¢ R: 
which are positive (i.e., take nonnegative values over nonnegative functions 
g € Rs) and have the unit norm (|| p || = 1). It is easy to see that theorem 2 
remains true if we replace in it the set Bs by the set Bs . 

Before passing to the next theorem we will consider an example applying 
theorem 2 in the form indicated in the Remark. 

Let S be a sequence {f;}f « H* converging weakly to an element fy « E* 
(lim fi(x) = fo(x) for every xe EF). In this case the space Qs is equivalent to 


the space (m), for every function g(f) € Qs is determined by its values & = ¢(fi) 
(i = 1,2, --- ) and || ¢|| = Sup|¢(f) | = Sup | & |. 
fes lsiso 
The set R of all functions ¢ € Qs such that 
lim o(fi) 


exists, form a linear subspace of Qs which evidently is equivalent to the space 
(c). Therefore to every functional p ¢ R* (see Banach [2], p. 65-67) corresponds 
a sequence {c;}9 such that 


= cael + lim o(f) (oe R), 


hence || p || = dX |c; | andc; = 0 (¢ = 1, 2, --- ) if the functional p is positive. 


Since the function ¢(f) and every function ¢.(f) = f(x) (x « E) belong to R 
we can apply to S theorem 2 (in extended form), and so we find that the regu- 
larly convex envelope K of the set S = {f;}? consists of all elements g ¢ E*, which 
admit the representation 


(*) g(x) = exfi(x) (xe E), 


no 


ev 


wh 
i} 
| (* 
en 
K 
In 
It 
th 
| Pa 
Ol 
tan 


=~ 


REGULARLY CONVEX SETS 561 


where 


(**) 20 @=1,2,---) and Da =1. 


As (*) and (**) imply that g = > cif; we conclude that the regularly convex 


envelope K of the set S = {f,}r coincides with its convex closed envelope. 

TuroreM 2. Let S;, Se, ---, Sn be a system of bounded sets in E* and let 
K,, Kz, -+- , Kn be the corresponding system of the regularly convex envelopes 
of 8: (i = 1, 2,---,n, K; D S;). Then the regularly convex envelope K of the 
logical sum S = Si U S2 U--- U S,° coincides with the convex envelope K' of 
the sum Ki U Ke U U Ka. 

Proor. Obviously K being convex and containing all K; contains also K'. 
It remains to show that, conversely, K C K’. In proving this we may suppose 
without loss of generality that all sets S; (¢ =-1, 2, --- , m) are disjoint, for in 
the contrary case we replace the sets S; by sets S; defined as follows: 


Si = Si, Si = Sp — SpSi — --- — (k = 2,3,---,n). 


Take any pe $s. Our theorem will be proved if we show that g = p(S) 
belongs to the convex envelope of the sum K’ = Ki U Ke U --- Kn. 
Put wi = p(ei) = 1, 2, --- , m), where 


1 for feS; 
0 for feS —S; (§ = 1,2, 


Obviously = = 1, 2,---,m) and + we = 1. 

To every function ¢; € Qs; corresponds a function ¢ determined as follows: 
¢g(f) for feS; 
0 for f € S a S;. 

Put ¥:(¢;) = p(g). Thus our functional p $s generates n functionals € 
(i = 1,2, --- ,), which, obviously, are nonnegative and have respectively the 


norms || ve || p(ex(f)) 1, 2, tr , 0). 
Define for each u; > 0 the functional p; « Bs by the formula 


(6) e(f) = 


“=| 


1 
p= — Yi. 
Mi 


Observe now that each function ¢ e Qs generates a function in each of Qs, 
(i= 1,2,-..,m). We shall denote this function by the same letter y. As for 
every 9 €Qs 


of) = 


1 


* Ie. the union of all elements belonging to at least one of the sets S,, «+: , Sn. 


é 
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we obtain 


In particular, as g(x) = p(f(a)) £), we have 
g(x) = wipi(f(x)) = (cE), 


where gi = pi(S;) K;. 
Consequently, recalling that > ui = 1, we conclude that g = . igi belongs 


ui>o 
to the convex envelope of the sum Ki U Ke U --- U Kz, which completes 
our proof. 

Coroutuary. If Ki, Ke,---,K, are bounded regularly convex sets in E*, 
then their convex envelope is also regularly convex. 

To obtain this corollary we put S; = K; (¢ = 1, 2, --- , m) in theorem 2 just 
proved. 

If a, a2, ---,@, are any real numbers we understand by + aS: + 
..» + a,S, the set of all f that admit the representation f = afi + --- + af, 
where f; (¢ = 1,2, , n). 

The method used by proving the last theorem permits also to prove the 

TuroreM 3. Let S,,---,S, and K,,---,K, have the same meaning as in 
the preceding theorem. Then the regularly convex envelope R of the set S = 
aS; + --- + an,S, coincides with the set R = + a2Ke + --- + 4,K,. 

Proor. In proving the theorem we may assume all a; > 0 for if some a; < 0 
we may replace such an a; by —a; and the corresponding K; by —K; without 
changing &, &’ and G; if, furthermore, some a; = 0, then we exclude from our 
considerations such a;’s and the corresponding K; . 

If we replace every set S; by the set S; + g (¢ = 1, 2, --- , m), where g « E* 
is a fixed element, the sets ® and &’ are replaced respectively by the sets + 9: 
and &’ + gi, where gi: = (> a) g. Therefore we may assume also that every 


1 
set S; lies at a positive distance from zero. But then evidently we may choose 
constants A; > 0, --- , An > Osuch that all the sets S; = AvS; (¢ = 1, 2, --- , n) 
are disjoint. Hereby 


S=L LYK, 


where K; = \:K; is the regularly convex envelope of the set S; (¢ = 1,2, ---,). 
In view of this we may also suppose without loss of generality that the given 
sets are disjoint. 

It is easy to see that ® DK’. It remains to show that RC K’. Put S = 
S: U So U--- U S, and let e(f) «Qs, (¢ = 1, 2,---,n), be the functions 
defined in (6). Consider now any nonnegative functional p € Q5 satisfying the 
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following conditions 


We state that g = p(S) «&’. In fact, reasoning as above in proving the pre- 
ceding theorem, we can state that the functional p generates n functionals 
p; Bs, such that for every ¢ € Qs 


(8) p(y) = = a; pi(¢) 
and therefore putting g; = pi(S;) eK; (¢ = 1, 2, --- , n), we find 
g = agi R’. 


Take now a go ¢ E* which does not belong to &’. Denote by L the linear 
subspace consisting of all functions ¥(f) € Qs of the form ¥(f) = ciei(f) + --- + 
Cnen(f) + f(x), where x ¢ E and c; (i = 1, 2, --- , m) are arbitrary real numbers. 
In this space let po(W) = cra: + + 

The functional pp) defined over L may be many-valued. We state that there 
exists at least one function 


(9) vo(f) = + +--+ + cnen(f) = 0 (f S) 
such that 
(10) Po(Wo) = go(%o) + ciai + --- + < 0. 


For in the contrary case if a ¥(f) = 0 (f « S) then po(y) = 0 and, in particular, 
v(f) = 0 and then po(y) = 0, that is po is a single-valued (evidently additive) 
functional taking over nonnegative functions ¢(f) «LZ nonnegative values. 
Since furthermore e(f) = e:(f) + --- + en(f), we may apply our Lemma and 
conclude that there exists a nonnegative functional p¢Qs such that p(y) = 
po(¥) for every Ye L. In particular 


(11) = = a 1, 2, n) 
and furthermore go(z) = po(gz) = p(¢z), i.e. go = p(S). But by the above 
go ¢X’, which contradicts our supposition. 

Thus there exists such an element 2 ¢ and numbers c} such that 
(9) and (10) hold. Now observe that (9) is equivalent to the system of the 
following inequalities: f;(ao) + ci = 0 for all fie S; (i = 1, 2,---,n). Multi- 
plying these inequalities respectively by a; > 0 (i = 1, 2, --- , n) and adding 
them up we obtain 
(12) (zo) + + anc, 20 forall feS. 

Therefore comparing (12) and (10) we find 


Sup f(—a») = Sup f(—m) aici + +++ + < go(—20). 
Hence go ¢&. Thus if go ¢ 8’, then go ¢8, i.e. R’ DK. The theorem is proved. 
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Corouuary 1. If Ki, Ke, --- , Kn are bounded regularly convex sets in 
then whatever be the numbers a, , +++ , Gn the set Ky + --- + aK, is also regu- 
larly convex. 

To obtain this corollary we put S; = K,; (¢ = 1, 2, --- ,) in the theorem 
just proved. 

Coroutiary 2. Let K C E* be a regularly convex bounded set and p > 0 an 
arbitrary fixed number. The logical sum K, of all spheres o(f, p) of radius p 
and with center f in K is also regularly convex. 

In fact, K, = K + o(@, p) and both K and o(@, p) are regularly closed. 

We shall now prove the following theorem. 

TuEeorEeM 4. Let K, C E* and Kz. C E* be two disjoint regularly convex 
bounded sets. Then the distance d between K, and Kz is positive, and to every 
d’ < d(d’ > 0) corresponds an x» € E such that 


(13) Sup < f(ao) — d’| x0 |. 


Proor. By corollary I to theorem 3 the set K = K, — Ke is regularly convex. 
As K, and Kz are disjoint, so the element zero, 6 C E, does not belong to K 
and consequently lies at a positive distance from K. Obviously, d is also the 
distance between K, and Ke. 

By corollary 2 to theorem 3 the set Ka is regularly convex and does not con- 
tain the zero 6. Consequently by the definition of regularly convex sets there 
exists an 2) such that 

Sup < 0. 


feKq’ 
Take an fy e K, then o(fo, d’) C Ka ; hence 


Sup = fo(ao) + d’|ao| < 0. 
f €a(fod’) 


Consequently 
(14) Sup f(20) < —d’|x|. 


Obviously (14) is equivalent to (13). 


2. Unbounded regularly convex sets. We now pass to the study of un- 
bounded regularly convex sets in E*. First of all we shall state the following 
theorem, fundamental in the sequel. 

TuroreM 5. In order that an unbounded set K C E* be regularly convex it is 


necessary and sufficient that the intersection of K with every bounded regularly 


convex set be also regularly convex. 
Proor.’ The necessity of the indicated condition is trivial. It remains to 
show its sufficiency, that is if K satisfies this condition, then to any given go ¢ K 


° The main idea of this proof is borrowed from Banach ([2], pp. 120-121). 
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corresponds an 2» € # such that 
Sup (x0) < g(x0). 


Denote by d the distance of g from K. Obviously d > 0. Take any sequence 
=1>& > & <d(d’ >0). We will now form 
by recurrence a sequence {xi} possessing the property: if for some f « E* and 
some natural number n the following conditions 


(5) eK, fla) 
hold, then 

En+i 


To obtain the first element x, of this sequence we consider the set H, of all f 
such that 
d’ 
feK and |f—g|<- 
the set H, being the intersection of K with the sphere o(g, d’/é2) is by the 
assumed condition a regularly convex bounded set. As g lies at a distance 2 d 
from H,, there exists such an 2 that 


Sup f(a) < g(x) — d’|a |. 


Obviously, by this choice of 2; the conditions f eK, f(a:1) 2 g(a) — d’| ax | 
imply |g —f|>d'/ee. Letnow 2, 22, --- , tn be formed so that (15) implies 
(16). Then 2,4; can be obtained as follows: Consider the set H+: of all f ¢ E* 
which satisfy the conditions (15) and belong to the sphere 


(17) lo-sis 
En+2 


If the set H,4, is empty, take for z,4, an arbitrary element different from zero. 
In another case H,4; being the intersection of K and of n + 1 regularly convex 
sets of which one is bounded (namely, the sphere o(g, d’/én42)) is itself regularly 
convex. As the conditions (15) imply (16), the set Hn, lies at a dis- 
tance > d’/en41 from g, and therefore there exists an 2,4; such that 


Sup f(tny) < g(tn41) — | | 
€ Hn+1 En4+1 
Consequently, if 
(18 feK, fle) z g(a) — 21% 


then f ¢Hn41 , which is possible if (17) does not hold, that is |g — f| > d’/enss. 
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Thus the existence of the required sequence {2;}f is proved. It follows from 
the definition of this sequence that if f ¢ K then at least one of the inequalities 
f(xi) S g(xi) — d' | a; \/e: (¢ = 1, 2, --- ) is true, ie. 

(19) Sup ote) = > d’. 


lsi<o 


Putting 


& | (i 1, 2, ), 
we may write the relation (19) in the form: 
(20) Sup {g9(ys) — f(ys)} > 


Denote by K, the set of all sequences {f(y:)}1 , wherefe K. As | y;| = 
we have lim f(y:) = 0 and therefore Ko C (co), where (co) is the space of all 
imo 


sequences —§ = {&;}f converging toward zero with the definition of the norm 
€|| = Sup | & |. 
lsiso 
The inequalities (20) show that the sequence {g(yi)}f € (co) is at a dis- 
tance 2 d’ from Ky. Furthermore, as K is convex, so is Ko. 
Therefore there exists in (c) a hyperplane 


(21) +k = a(S = 1), 
which passes through the point {go(y:)}? , ie. 
(22) a:go(yi) = ao, 


and whose distance from Ky is 2 d’. The last fact can be expressed by a 


suitable choice of the sign of a as follows: 


Putting then 

(24) Xo = 0; Yi 

we obtain from (22) and (23) that 

(25) < a’ (f eK), 


which completes the proof of our theorem. 

TuEoreM 6. In order that a linear subspace F C E* be regularly closed it is 
necessary and sufficient that there exists in F a bounded regularly convex set K 
which has in F interior points (i.e. is in F a convex body). 
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Proor. To show the necessity of the condition, take in F any bounded set S 
containing in F interior points. Form then the regularly convex envelope K 
of S. As the intersection of K and F (if F is regularly closed) must be also 
regularly convex, it coincides with K. Therefore K C F satisfies all the require- 
ments of our theorem. 

Conversely, let F contain a bounded regularly closed body K. Obviously, 
we may suppose that the null element 6 ¢ K, for in the contrary case we bring 
about this situation by a suitable translation of K in F. Let now K’ be any 
bounded regularly closed set. It is clear that for a sufficiently large the 
intersection of K’ and F coincides with the intersection D of K’ and \K. But 
K’ and XK being regularly closed, so is D, which proves our theorem. 


TuroreM 7. If Ki, Ke, --- ,K, are regularly convex sets in E* and if fur- 
thermore all these sets except maybe one are bounded, then the set K = a,K, + --- + 
a,K, is regularly convex whatever the constants a; , dz, +--+ , A, maybe. 

Proor. In the case in which all sets Ki, --- , K, are bounded the theorem 
coincides with corollary I of theorem 3. Let then K,, --- , Kn-1 be bounded 
and K, unbounded (a, ¥ 0). Put 

C; = Sup | f | (¢ = 1,2,---, — 1). 
fe Kj 


Take now any bounded set H C E*: 
C = Sup |f| < ~. 
fed 


Choose a number R > O such that 


n—l 


|an|R— 


Denote by K;, the intersection of K, with the sphere o(@, R) and put K’ = 
+ + The sets Ki,---,Kn+, Kn being regu- 
larly convex and bounded, so is the set K’ and, consequently, the intersection 
K' 1) H. But it is easy to see that by our choice of R we have K N H = 
K'fN H. Thus K * H is regularly convex, which proves our theorem. 

Corottary. Corollary 2 of theorem 3 remains true if K is an unbounded 
regularly convex set. 

Consequently, theorem 4, being based on the two corollaries of theorem 3, 
also admits a generalization, namely: 

THEorEM 8. Theorem 4 remains true if one of the sets Ki , Kz is bounded and 
another is unbounded. 

Observe also that for the case in which K, is unbounded and Ke consists of 
only one element g, this theorem is a consequence of the inequality (25), for it 
follows from (24) that 
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and consequently (25) implies 
Sup f(x) S g(ao) — d’ | |. 


3. Transfinite methods. In this paragraph we will outline another method 
for studying the regularly convex set which is based on some ideas of Banach 
(see [2] pp. 118-126). 

Let 3 be a limit ordinal. Denote by (ps) the linear space of bounded 
transfinite sequences of real numbers X = { 8.}7 of the type 3 (1 Sa < 9), in 
which the operations of addition and multiplication by a scalar are defined as 


follows: 
1°. If X = {ta}, ¥Y = {na}, then X + Y = {fa + ne} 
2°. If X = {&a} and is a scalar, then XX = {réa}. 
Put 
p(X) = lim Ea. 
Obviously, the functional p(X) has the properties: 
P(X + Y) Sp(X)+ PY), pAX) =rAM(X), AZO. 


By the well-known theorem of Banach (see [2] p. 29) there exists an additive 
functional p(X), such that 


a) p(X) S p(X) (X € (p9)), 
and consequently 
b) —p(—X) S p(X) (X € (p9)). 


Put for arbitrary X = {£.}] € (ps) 
Lim = p(X). 
It follows from a) and b) that 
(26) lim Lim lim ({Ea}i (Po)). 


ad 
Furthermore, as p(X) is an additive functional we have 
(27) Lim {NEa + una} = Lim + » Lim qe. 
From now on in this paragraph we shall agree to associate with every limit or- 
dinal # a fixed operation Lim on (ps) to the real axis, which possesses the 
ad 


properties (26) and (27). 
Let now {f2}? be any ordered sequence of the type 3 (1 < a < #), the ele- 
ments of which belong to E* and are bounded in their totality, i.e. 
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Sup | fa | < 
lsacd 


As for every x € E the sequence {f.(x)} is bounded, we may make correspond to 
the sequence { fa}? a functional defined as follows: 


= Lim fa(x) (a E). 


In virtue of (26) and (27) the functional fo(x) is additive and 
| fo(x) | S Sup | fa(x) | Sup |fa|- |x|. 
lsacd lsacd 


Therefore fo « E* and 
(28) | fo | < Sup | fa |. 
lsa<cd 


Modifying slightly Banach’s definition ({2], p. 119), we shall call the functional 
fy transfinite limit of the sequence {f.} and we shall write 
(29) fh = lim fa. 
Now it is natural to say that a set K C E* is transfinitely closed if it contains the 
transfinite limits of all ordered bounded sequences {f,} belonging to it. 
THEorEM 9. In order that a set K C E* be regularly convex it is necessary and 
sufficient that it be transfinitely closed. 
Proor. The condition is necessary. In fact, if a bounded ordered sequence 
\fa} (1 S a < #) belongs to the regularly convex set K, then the functional 


fo = Lim f. 
ad 


also belongs to K for in the contrary case there exists an 2 « E* such that 


Sup fa(2o) Sup f(x) < fo(zo), 
lsacd 


which is incompatible with (26). 

In proving the sufficiency of our condition we may suppose without loss of 
generality that K is bounded. In fact, if K is transfinitely closed then by (28) 
80 is the intersection of K” with the sphere o(@, n). On the other hand if we 
show that K” is regularly closed, then so will K be, by theorem 5. 

Thus suppose K bounded and transfinitely closed. Denote by K’ the regu- 
larly convex envelope of K. To prove our theorem it remains to show that 
K'= K. Take a geK’. Let us well order the elements of FE (E = {za}, 
(a < #)). To prove that g eK we show that to every 3; < 0 corresponds at 
least one f ¢ K such that 


(30) f(ta) = (a < 
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Let first 3; = n + 1 be an integer. Consider the set &, C R, (R, is n-dimen- 


sional Euclidean space) of all points (é, --- , &), where re 
It is easily seen that &, is convex and closed (the latter in virtue of the trans- its 
finite closedness of K). We state that the point sp 
i) belongs to ®,. In fact, in the contrary case there exists by a theorem of we 
‘ Minkowski a system of numbers ai, --- , a, such that 
Sup pm < in OG Ni » in 
1 1 the 
a | which by (31) and (32) may be written as follows co 
n n 3: 
1 1 K 
Brie i | Since g¢K’ this inequality contradicts corollary 1 of theorem 1. Thus ths 
(m,--+ mn) €R,, i.e. ford; = n + 1 there exists a f K such that (30) holds. 
wiki) | Now we can apply the transfinite induction. Indeed let the limit ordinal ( 
| 3, have the property: to every < # corresponds a fg K, such that = pre 
Al Mel | g(ta) (a < B). Then K being transfinitely closed the functional the 
= Li 
Fo, Ie 
| belongs to K and, furthermore, as it satisfies the conditions fs,(%a) = g(Xa) seq 


. (a < 0), the possibility of applying the transfinite induction is established. 
| This completes the proof of our theorem. 
The reader who is familiar with the book of Banach has probably remarked 


that theorem 9 presents a generalization of Banach’s theorem concerning trans- De 
finitely closed subspaces ({2], Lemmas 2, 3, pp. 119-122). Our method of eas 
proving theorem 9 is slightly different from that of Banach, though it is possible I 
also to prove this theorem following his methods. {fn 


It is easy to see that theorem 7 may be obtained as an almost immediate 
corollary of theorem 9. However, we should meet some difficulties in this 
deduction of the theorem if we had defined transfinitely closed sets exactly as 


Banach defined them. bela 

We also find it interesting that all the main properties of regularly convex fun 

sets may be obtained as has been shown in the preceding paragraphs without E 

resorting to transfinite limits. “i 

the | 

4. Separable sets. The criterion that a set K C E* is regularly convex !s 8 

ie much simplified in the cases in which the space E or the set K itself are sepa- o 
rable. We begin with 


‘ 
af 
3 
ag 
j 
: 
3 
} 
3 th 
he 


A- 


REGULARLY CONVEX SETS 571 


TueorEM 10. Let E be a separable Banach space. Then a set K C E* is 
reguiarly convex if and only if tt is convex and weakly closed (as a set of linear 
functionals)” 

Proor. The necessity of the indicated condition being trivial, we shall prove 
its sufficiency. Thus let K ¢ Z* be a convex and weakly closed set. As the 
sphere ¢, = o(8, n) is also convex and weakly closed, so is the intersection, 
K,, of K and o,. Now if we prove that K, is regularly convex, then so will 
be the set K (by theorem 5). Therefore in proving that K is regularly convex, 
we may suppose at once that K is bounded. 

Let g « K’, where K’ is the regularly convex envelope of the set K. Our 
theorem will be proved if we show that ge K. Let {x,} be a dense sequence 
in E. Since E is separable, K is weakly compact. Therefore we may reason 
the same way as in proving theorem 9 and show that to every n = 1, 2, --- 
corresponds a f, ¢ K such that 


(33) fn(Xi) = 1, ,n). 


K being weakly compact, there exists a subsequence {f,,} and an fo « K such 
that fo(xz) = lim f,,(a) (ce Then in virtue of (33), fo(x:) = = 
1, 2,--- ) and consequently g = foe K. Thus the theorem is proved. 

One can easily obtain theorem 10 from theorem 9 as Banach had done by 
proving his particular case (see footnote 7), but we wished to show how the 
theorem may be proved without transfinite limits. 

Lemma 1. Whatever be the separable subspace F C E*, there exists a separable 
subspace G E such that F G* 

Proor. Let {fm}? be a sequence dense in F. To every f, corresponds a 
sequence {2mn}n—1 such that 


tim = (n = 1,2, +++). 
Denote by G the linear envelope of the set {zmn} (m,n = 1, 2,---). It is 
easy to see that |f|z = |fle if feF. Therefore we may write F C G*. 
Lemma 2, Let S be a separable bounded set in E*. If for each sequence 
\fn| C S the functional 


fo = Lim f, 


belongs to H, then the set H is weakly compact (in the sense of weak convergence of 
functionals).° 
Here we denote by Lim an arbitrary fixed operation (generalized limit) 


"For the particular case when K is a linear subspace of E* one can find this theorem in 
the book of Banach ((2], p. 124). 

*See V. Smulian [15] and [16]. 

* This lemma is taken from a paper of V. Smulian [16], where it is proved in a somewhat 
More general form. 
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corresponding to the first infinite ordinal wantin 3 = w which was defined in 
the preceding paragraph. 

Proor. Take an arbitrary sequence {f,}7 C S. Denote by F the linear 
envelope of the set S. As the set F is separable, there exists by the preceding 
Lemma a separable subspace G C E, such that F C G*. Let {xi}f be a se- 
quence dense inG. It is clear that if f’, « F and f’(xi) = f’’(x,) (¢ = 1,2, ...), 
then f’ = f’. Choose by the diagonal process of Cantor a subsequence {f,,,}?; 
such that there exists lim f,,(z:) for each 7 = 1, 2,---. 


We now assert that lim f,,(x) exists for every x eH. Admit the contrary, 
ie. that for some 2 ¢ E the lim f,,(%) does not exist. Then we can form two 


subsequences {fp,}%-1 and {f,,}5—1 such that there exist two different lim f,, (2) 
and lim f,,(%). On the other hand putting 
= Lim f,,, f” = Lim f;, 


vo 


we obtain that 


(34) = lim fp, (2s) = lim = (x) (¢ = 1, 2,---) 
and : 
(35) f' (to) (x). 


Since by the condition of our Lemma f’ eS C F, f’’ eS C F, the relations 
(34) and (35) are in contradiction. 

We shall say that a set S C E* is a locally weakly compact set if every bounded 
sequence {f,} C S contains a subsequence {f,,} which converges weakly to a 
functional belonging to S. 

THEOREM 11. In order that a separable set K be regularly convex it is necessary 
and sufficient that K be convex and locally weakly compact." 

Proor. To prove the necessity of our condition, consider the intersection 
K,, of K with the sphere o(6, n) (n = 1, 2,---). If K is regularly convex 
then so is K,. Therefore, by theorem 9, K,, satisfies the conditions of lemma 2 
and is consequently weakly compact. Hence K is locally weakly compact. 

To prove the sufficiency of our condition we take an arbitrary g¢K. Con- 
sider the linear envelope F of the set {K, g}. By lemma 1, there exists a 
separable subspace G C E such that F C G* and consequently K C (* and 
g«G*. K being locally weakly compact as a set of E*, it is the same regarded 
as a set of G*. Consequently K is weakly closed as a set of G* and, being 
convex, is by theorem 10 regularly convex as a set of G*. Hence there exists 
an element 2» ¢ G such that 

Sup < g(x), 


which completes our proof. 


10 The particular case of this theorem, wlan K is a linear subspace was already estab- 
lished by V. Smulian [15]. 
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CuHapTer II 


1. Principal properties of factor-spaces. If is a linear subspace of E, we 
denote by E/G the factor group, obtained from E and G if we consider E and G 
asa group and a subgroup, respectively, with respect to the addition operation 
of the elements. The elements of E/G will be denoted by Greek letters &, n, 
So every element « Z/G can be considered as a set of elements x 
such that, if a « &, then the element z belongs to é if and only if x — 2 €G. 

It is easy to see (see Hausdorff [4]) that E/G becomes a Banach space if the 
multiplication of the elements  e E/G by a scalar and the norm | é | are defined 
as follows: 

The product Ag is the element 7 ¢ E/G which contains all elements \ where 
ze; the norm 

= Inf = Inf |x ~ | (x08). 


If F is a linear subspace of Z*, we may consider also the factor space E*/F. 
The elements of the latter we shall denote by Greek letters ¢, y, x, ---. 

Lemma. If G is a closed linear subspace of E, and F" is the set of all elements 
feE* such that 


(36) f(z) =0 (x eG) 
then G is the set of all elements x ¢ E such that, 
(37) f(z) = 0 (feF). 


Conversely, if F is a linear, regularly closed subspace of E*, and G” is the set of 
all elements x € E for which (37) holds, then F is the set of all elements f « E* such 
that (36) holds. 

Proor. To prove the first statement we have to show that if a ¢G@ then 
there exists an fo e F such that fo(ao) # 0. But if 2 ¢G, @ being closed, there 
exists an fo « Z* such that fo(ao) = 1 and fo(z) = OforzeG. As the last condi- 
tion signifies that feF, the statement is proved. The second part of the 
lemma follows from the definition of regularly closed subspace of E*. 

TaEorEM 12. If F is a regularly closed subspace of E*, and G denotes the set 
of all elements x € E such that f(x) = 0 for f « F, then 


(38) F = (E/G)* and G* = E*/F. 


Proor. To prove the first of these equalities let us put for every fF, 
f(é) = f(x), if € = & contains z. Obviously, f(é) is an additive single-valued 
functional defined on E/G. As | |= 
for weG, S |f|e-Inf |z — u| = [fle-|g|. Thus is a linear 
functional defined on H = E/G with the norm 
(38) [fla S|fle- 


" Obviously, F is regularly closed. 
” Obviously, G is closed. 
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Conversely, let denote a linear functional defined on Putting 
f(x) = f(é) if x € — we obtain an additive functional defined on E and satisfying { 


the condition 
(40) | f(z) = 0 (v 
As | f(x) | = | S lal |S | weg), fe and 
Further, by (40) and the Lemma feF. Also every functional f (Z/G)* gen- 


erates a functional denoted by the same letter f « F and conversely. 
Comparing (39) and (41), we see that |f|z = |f |, which completes the 


4 proof of the first equality (38). t 
Hl Now let fo(x) be a linear functional defined on G(fo « G*). Let g» denote the . 
ag set of all f « Z* such that f(x) = fo(x) for zeG. According to the theorem of 
Hahn-Banach the set gp is not empty and moreover contains at least one element p 
f’ such that | f’ = !fle. If go ego, then g € go (g E*) if and only if g(x) = 


\ . go(x) for x eG, i.e. g — goe F. Hence we can consider the set gp as an element 
i of E*/F. Obviously f 


] 
| |= Inf [fle le = Lolo. 
H € 


i 4 Thus every element f ¢ G* generates an element ¢ = g; ¢ E*/F such that el 
i t Conversely, given an element go ¢ E*/F, for each pair gi € go and ge € go, we have 
| g(x) = go(x) for x eG. Consequently, go determines an element fo G* such 
that fo(z) = g(x) for x eG and every gego. Since |fole S |g forgeq, 
8} 
| (43) Inf |g le = | vol. 
ge 
Thus every element ¢ « E*/F generates an element f ¢ G* such that g = g, , and th 
(43) holds. Comparing (42) and (43) we see that | f |¢ = | g, |, which completes qu 
the proof of the second part of the theorem. 80 
In view of the Lemma we can give another form to our theorem. {a 
THEoREM 12’. If G is a linear closed subspace of E and F is the set of all func- {fg 
tionals f e'E* such that f(x) = 0 for x eG, then G* = E/F and F = (E/G)*. 
Using the notations of the last theorem, let us now consider the spaces E** th 
and G** conjugate to the spaces E* and G*, respectively. Let L be the set me 
F of all elements p ¢ H** such that p(f) = 0 forf¢F. Then according to the last ~ 
fr theorem L is the space conjugate to the space E*/F, which in turn is the conju- 
gate space of G. Thus G** = L C E**, We obtain thus 
I THEorEM 13. If G is a linear subspace of E, then G** may be regarded as a ' 
subspace of E**. if | 
2. 1. The Banach space Z is said to be regular if to every p « E** corresponds prc 


an E such that p(f) = f(x) for A. J. Plessner has remarked that rey 


if | 
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every linear closed subspace of a regular space is also regular.” This proposi- 
tion is an immediate consequence of the proof of theorem 13. The following 
theorem complements the proposition of A. J. Plessner. 

TueorEM 14. Let G be a linear closed subspace of E. In order that E be 
regular it is necessary and sufficient that G and E/G be regular. 

Proor. Let E be regular. Then, as we already know, G is regular. We 
have to show that the factor space F/G is also regular. Keeping the notations 
of the preceding paragraph we may state that the space conjugate to F/G is a 
subspace of the regular space E* and consequently is regular. But if F = (E/@)* 
is regular, then so is B/G." 

Conversely, let G and E/G be regular. Let po « H**. Considering the func- 
tional po(f) on F = (E/G)*, we find in view of the regularity of E/G that there 
exists a £, € E/G, such that po(f) = f(t.) = f(ao) for fe F. 

On the other hand putting pi(f) = p(f) — f(xo) for f « H*, we obtain that 
p(f) = OforfeF. Therefore, considering p; as a linear functional defined on 
E*/F = G*, we find that p «G**. Since G is regular, there exists a x « G such 
that pi(ys) = for gy H*/F or pi(f) = f(a) for fe H*. Thus p(f) = 
f(t. + 21) = f(x) for f « E*, which completes the proof of the theorem. 


2. A set Mt of a Banach-space EF will be said to be weakly compact if every 
sequence {zn}? C M contains a subsequence {z,,}>-1 which converges fo an 
element of E. 

In proving the next theorem we shall rely upon two propositions:” 

A) The unit sphere of a regular Banach space is weakly compact. 

B) The unit sphere of a Banach space E is weakly compact if and only if the 
unit sphere of the conjugate space is weakly compact. 

THEOREM 15. Let G be a linear, closed subspace of E. In order that the unit 
sphere of the space E be weakly compact it is necessary and sufficient that the unit 
spheres of the spaces G and E/G be weakly compact. . 

Proor. If the unit sphere of G is weakly compact then it is edsy to see that 
the unit sphere of G is weakly compact. Furthermore, given a bounded se- 
quence {&,}r C E/G (| &&| < p,n = 1, 2,--+ ), we may choose such 2, € 
so that also |z,| < p(n = 1, 2,---). But then there exists a subsequence 
\tn,}om1 that converges weakly to an 2% ¢ E and this implies that the sequence 
Converges weakly to . 

Conversely, let the unit spheres of G and E/G be weakly compact. In proving 
that the unit sphere of E is weakly compact it is sufficient to prove the state- 
ment in the special case when the space E/G is separable. In fact, given a 


'’ This fact is an immediate consequence of the criterion for the regularity of the Banach 
space which was also indicated by Plessner. A simple proof of this criterion was proposed 
by V. Gantmacher and V. Smulian [3]. 
a This follows from the proposition of Plessner: A Banach space is regular if and only 
if its conjugate space is regular ({8], p. 115). 

* These propositions have been proved by V. Gantmacher and V. Smulian [3]. The 
Proposition A), as well as the proposition of Plessner indicated above, have been recently 
repeated by Pettis [11]. 
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bounded sequence {z,}7 C EH, we may form the linear, closed envelope EF, of 
the set {G, a1, t%2,---}. Then £,/G, is separable and being a subspace of 
E/G has a weakly compact unit sphere. Therefore if our theorem is established 
for the case when the factor-space is separable, the unit sphere of EF, will be 
weakly compact and consequently the given sequence, {z,} C E,, will contain 
a subsequence {2,,}>—1 which will converge weakly to an 2 ¢ H, C F. 

Thus we suppose without loss of generality that the factor-space E/G is 
separable and consequently regular. On the other hand, to prove that the unit 
sphere of E is weakly compact it is sufficiently by B) to show that the unit 
sphere of E* is weakly compact. To this purpose let us consider the space 
F = (E/G)* C E* and E*/F = G*. The space F is regular, and by A) the 
unit spheres of F and E*/F are weakly compact. 

But as above by proving that #* possesses the required property we may 
suppose that #*/F is separable and consequently regular. Then, according to 
theorem 14, #* is regular and consequently its unit sphere is by A) weakly 
compact, which completes the proof of the theorem. 


3. Linear envelopes of regularly convex sets. The results of this paragraph 
complete in some respect those of the preceding paragraph. 

Let F be the linear envelope of a bounded, regularly convex set K C E*. 
Denote by G the set of all elements x such that f(x) = 0 for f eK, and by (E/G)x 
the factor group E/G with the new definition of the norm, namely 


(44) lx = Sup [f(e) | (eB /6), 
where x is an arbitrary element belonging to &. The set K being bounded, put 
C = Sup |f|. 

Then from (44) 
(45) Cla}. 


Moreover since for every u eG 
= Sup [f(r u) | $ Cla ul, 


we find also that 
C Inf |x — u| =Clé|. 


For every fe F: f(x) = f(y), if only x — y €G; therefore we can define f(£) as 
equal to f(x), where x¢e& Obviously, f(é) is an additive, single-valued func- 
tional defined over (E/G)x. Moreover f(£) is continuous over (E/G@)x. In 
fact, every f « F admits the representation f(g) = const. (fi(é) — fe(é)), where 
fie K (i = 1,2). Since, in virtue of (44), | fi(é) | S | E |x ( = 1, 2), we see that 
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(46) \f |x = Sup < 
Conversely, let fo(€) denote a linear functional defined over (E/G)x. Putting 


f(z) = folé) if x ¢& we obtain an additive functional defined over E for which 
by (45) 
(47) | | = | folé) | S | folelElx SC | fo |x| z|. 


Thus fy is a linear functional belonging to E*. We will show that moreover 
fveF. To this purpose we denote by K* the convex envelope of the sum of 
the set K and the symmetric set —K. By corollary 1 of theorem 2, K* is regu- 
larly closed. Obviously, 


(48) |Elx = Sup f(z) (veé). 


In proving that fo ¢ F it is sufficient to show that each fi = Afoe K* if |A| S 
1/|folx. Admitting for some fi the contrary, there will exist an 2 ¢«# such 
that 


(49) Sup (zo) < filwo) = filfo) | & |x (2x0 € £0). 


But this contradicts (48). Thus fi¢K*. Observe also that, conversely, if 
feK, then |f|x = 1, for according to (44) | f(é)| S |&|x. Denote by Fx 
the vector space F in which is introduced the norm | f |x defined by (46). Then 
we may formulate the result obtained as follows: 

TurorEeM 16. The Banach space Fx is the conjugate space to the space (E/G)x , 
furthermore, the set K* is its unit sphere. 

The space Fx being a conjugate space is complete. On the other hand it 
follows from (45) that |f| S$ C|f\x (fe F). Therefore in virtue of a well- 
known theorem of Banach (see [2], p. 41) if F is closed the two norms | f | and 
J |x define the same topology in F, which implies the existence of a constant C; 
such that | f |x S Ci|f| (fF). Consequently in this case the intersection R 
of the sphere o(6, 1) C E* with F coincides with the intersection of the sphere 
o(6, 1) with the set C,:K. The last two sets being regularly convex, so is the 
set R. Recalling theorem 6 we arrive at 

THEorEM 17. If the linear envelope F of a bounded, regularly convex set K C E* 
is closed, then it is regularly closed. 

Note also the following 

THroreM 18. If K C E* is a bounded regularly convex, separable set, then 
the factor-space (E/G)x is separable. 

Proor. Evidently K* is also separable and being bounded and regularly 
closed is by theorem 11 weakly compact as a set of functionals. Then by 
theorem 22, which we shall prove in the sequel, the set K* is a weakly con- 
‘nuous image of a compact metric set. But K* being the unit sphere of the 
conjugate space to the space (E/G)x , the aforesaid property of K* implies (by 
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bounded sequence {z,}7 C EH, we may form the linear, closed envelope F, of 
the set {G, 11, 2,--- }. Then £,/G, is separable and being a subspace of 
E/G has a weakly compact unit sphere. Therefore if our theorem is established 
for the case when the factor-space is separable, the unit sphere of EZ; will be 
weakly compact and consequently the given sequence, {z,} C E,, will contain 
a subsequence {2,,}>-1 which will converge weakly to an % € HE, C F. 

Thus we suppose without loss of generality that the factor-space E/G is 
separable and consequently regular. On the other hand, to prove that the unit 
sphere of E is weakly compact it is sufficiently by B) to show that the unit 
sphere of E* is weakly compact. To this purpose let us consider the space 
F = (E/G)* Cc E* and E*/F = G*. The space F is regular, and by A) the 
unit spheres of F and E*/F are weakly compact. 

But as above by proving that EH* possesses the required property we may 
suppose that H*/F is separable and consequently regular. Then, according to 
theorem 14, E* is regular and consequently its unit sphere is by A) weakly 
compact, which completes the proof of the theorem. 


3. Linear envelopes of regularly convex sets. The results of this paragraph 
complete in some respect those of the preceding paragraph. 

Let F be the linear envelope of a bounded, regularly convex set K C E*. 
Denote by G the set of all elements x such that f(x) = 0 for f «eK, and by (E/G)x 
the factor group E/G with the new definition of the norm, namely 


(44) = Sup |f(@) | /G), 
where zx is an arbitrary element belonging to & The set K being bounded, put 
C = Sup |f|. 

Sek 

Then from (44) 
(45) 


Moreover since for every u ¢€G 


= Sup lf@@—u)| Clix —u|, 


we find also that 
< C Inf |x — w| = Clé|. 


For every fe F': f(x) = f(y), if only x — y eG; therefore we can define f(£) as 
equal to f(x), where x e& Obviously, f(£) is an additive, single-valued func- 
tional defined over (E/G)x. Moreover f(g) is continuous over (E/@)x. In 
fact, every f « F admits the representation f(é) = const. (fi(é) — fo(é)), where 
fie K (i = 1,2). Since, in virtue of (44), | fe(é) | S | E |x (¢ = 1, 2), we see that 
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(é) 
(46) lf lx = Sup < ~, 


Conversely, let fo(€) denote a linear functional defined over (E/G)x. Putting 
f(t) = fol) if 2 ¢& we obtain an additive functional defined over E for which 


by (45) 
(47) | | = | folé) | | folelElx SC | fo |x| x}. 


Thus fy is a linear functional belonging to E*. We will show that moreover 
feF. To this purpose we denote by K* the convex envelope of the sum of 
the set K and the symmetric set —K. By corollary 1 of theorem 2, K* is regu- 
larly closed. Obviously, 


(48) = Sup (ree). 


In proving that fo ¢ F it is sufficient to show that each fi = Afoe K* if |X| S 
1/|folx. Admitting for some f; the contrary, there will exist an 2 ¢ Z such 
that 


(49) Sup (zo) < filwo) = filéo) | |x (xo € £0). 


But this contradicts (48). Thus f;eK*. Observe also that, conversely, if 
feK, then |f |x = 1, for according to (44) | f(é) | = |&|x. Denote by Fx 
the vector space F in which is introduced the norm | f |x defined by (46). Then 
we may formulate the result obtained as follows: 

TuroreM 16. The Banach space Fx is the conjugate space to the space (E/G)x , 
furthermore, the set K* is its unit sphere. 

The space Fx being a conjugate space is complete. On the other hand it 
follows from (45) that |f| < C|f\x (f¢F). Therefore in virtue of a well- 
known theorem of Banach (see [2], p. 41) if F is closed the two norms | f | and 
J \x define the same topology in F, which implies the existence of a constant C; 
such that | f |x S Ci|f| (fF). Consequently in this case the intersection R 
of the sphere o(@, 1) © E* with F coincides with the intersection of the sphere 
o(8, 1) with the set C,}K. The last two sets being regularly convex, so is the 
set R. Recalling theorem 6 we arrive at 

Tuxorem 17. If the linear envelope F of a bounded, regularly convex set K C E* 
is closed, then it is regularly closed. 

Note also the following 

Turorem 18. If K C E* is a bounded regularly convex, separable set, then 
the factor-space (E/G@)x is separable. 

Proor. Evidently K* is also separable and being bounded and regularly 
closed is by theorem 11 weakly compact as a set of functionals. Then by 
theorem 22, which we shall prove in the sequel, the set K* is a weakly con- 
tinuous image of a compact metric set. But K* being the unit sphere of the 
conjugate space to the space (E/G)x , the aforesaid property of K* implies (by 
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the Lemma which will be proved at the beginning of the following paragraph) 
the separability of the space (E/G)x . 


CuapTER III 


1. Weakly compact sets in the conjugate space. In this paragraph we use 
the notions weakly convergent, weakly compact, etc. for sequences, sets, etc. of 
elements from E* in the sense which is usually adopted for functionals. To 
avoid a misunderstanding we will say also that a set S C E* is weakly compact 
in E* if every sequence {f,}r C S contains a subsequence {f,,} converging 
weakly to an element fo « H*; if furthermore, whatever be the sequence {f,|, 
the corresponding elements fo all belong to S, then S is simply called weakly 
compact (or weakly compact in itself). 

We first prove some Lemmas: 

Lemma 1. In order that a Banach space E be separable it is necessary and 
sufficient that the unit sphere o = o(8, 1) of the conjugate space E* be a weakly 
continuous image of a compact meiric set. 

Proor. If E is separable, then there exists a sequence {z,}f dense in the 
unit sphere || 2 || < 1. We introduce a new distance o(g, f) between two ele- 
ments g ¢ E* and f « E* as follows: 


l<n<o n 
As E is separable, o is weakly compact. It is easy to see that o becomes by this 
new metric a compact metric set, in which the metric convergence coincides 
with the originally weak convergence. Thus o can be looked at as a weakly 
continuous one-to-one image of a compact metric set. 

Let now conversely o be a weakly continuous image of a compact metric 
set 7. Then a is the range of a weakly continuous function f; defined over 7’ 
(te 7). To every x ¢ E we make correspond a continuous function ¢,(t) (t € 7) 
as follows: g(t) = f:(x) (te 7). Thus the space E is isometrically transformed 
into a linear subspace of the space [7] consisting of all continuous functions 
g(t) with the usual definition of the norm: || ¢ || = Sup | ¢(é) |. But the space 

teT 


[7] is separable, and, consequently, so is E. 

Given a set S C E* we denote by S the weak closure of S, i.e. the totality 
of all elements of S and of all their weak limits. 

Lemma 2. If E is separable and S GE is bounded, then the weak closure 8 
of S is weakly compact and consequently wéakly closed. 

Proor. Take in E* a sphere o(6, r), which contains S. In proving the pre- 
ceding Lemma we have shown, E being separable, that « can be regarded as a 
metric compact set in which the metric convergence coincides with the weak 
convergence. But then S is the metric closure of S in o and, consequently, 1s 
metrically closed and also weakly closed. 

From theorem 10 and Lemma 2 follows 
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TurorEM 19. If E is separable and S Cc E* is bounded, then the weak closure 
Sconv. Of the convex envelope Seonv. of S coincides with the regularly convex envelope 
K of 8. 

> Since K is convex and regularly closed (and consequently weakly 
closed) we have Sconw. © K. On the other hand it is evident that Sconv. is 
convex and, being by lemma 2 weakly closed, it is by theorem 10 regularly 
convex. Hence KC & conv. + 

It is now easy to prove the : 

TurorEM 20. Let S C E* be a weakly continuous image of a compact metric 
set T. The weak closure Seonv. of the convex envelope Seonv. of S coincides with 
the regularly convex envelope K of S. Furthermore, the set Sconv. itself is a weakly 
continuous one-to-one image of a compact metric set. 

Proor. Obviously S is a bounded set and therefore K is bounded. Starting 
from K, introduce in the same way as in Chap. II, §3 the linear subspace G C E, 
the factor-space (Z/G)x , and the conjugate space Fx = (E/G)x. The set S 
can be regarded as a set in E* or asa set in Fx. But it is easy to see that in 
both cases the set Sconv. remains the same. Therefore, recalling theorem 19, 
we see that our theorem will be proved if we show that the space (E/G@)x is 
separable. To this end we reason in like manner as in proving Lemma 1. By 
the condition of our theorem there exists a weakly continuous function f;, , 
(te 7’), the range of which coincides with S. Make correspond to every 
te(E/G)x the function ¢g;(t) defined as follows: ¢:(t) = fi(x) (te 7), where 
zeé. As in virtue of corollary I of theorem I 


(51) = Sup | = Sup lf(x) | = Sup | = Sup | ge(t) |, 


the correspondence £ <> g between (H/G@)x and the subspace of the space (T] 
consisting of all continuous functions g(t) defined over T is a linear, isometric 
one. But the space [7'] being separable, so is the space (E/G)x , which com- 
pletes the proof of our theorem. 

TurorEM 21. Let S C E* be weakly compact in E*. If S is separable, then 
iis a weakly continuous one-to-one image of a compact metric set. 

Proor. Let F be the linear envelope of S. Evidently, F is separable and 
consequently, by Lemma I of Ch. I, §4, there exists a separable subspace G C E 
such that F C G*. On the other hand, if G is separable, we can apply Theorem 
19 to the set S. Thus we obtain that S, considered as a subset of G*, is a 
weakly continuous one-to-one image of a compact metric set 7’. 

Denote by f; the image of te 7 in S; then t, — t implies 


(52) 


for every x €G. Our theorem will be proved if we show that (52) holds for 
every xe H. Admit the contrary, i.e. that there exists an 2 « # such that 


(53) lim sup — | > 0. 


use 
. of 

To 
fa, 
ukly 

the 
ele- 
his 
kly 
tric 
r T 

T) 
1ed 
ons 
ity 

sa 
ak 
, is 


| 
4 
| 
4 


580 M. KREIN AND V. SMULIAN 


As S is weakly dense in S, the antecedent of S in T is dense in T. Therefore 
there exist points t, ¢ 7’ (n = 1, 2, --- ) such that 


<>, (n = 1,2, 


On the other hand, S being weakly compact there exists a subsequence {f,; | 
converging weakly to an fo e S as a set of functionals of Z*, that is lim f,; (x) = 


fo(x) (x ¢ E). As for every x eG the function f,(z) is continuous on the com- 
pact 7’ and, consequently, uniformly continuous on 7’, we have also that 


(54) lim fr,,(%) = folx) (x eG). 


From (52) and (54) we conclude that fo(x) = fi.(z) for every «eG and, 
consequently, 


(55) fo(x) = (x 


for |f’ =|f' |e, ff’, « F. Comparing (53), (54) and (55) we 
come to a contradiction. Thus the theorem is proved. 

THEOREM 22. Let S C E* be weakly compact in E*. If 8S is separable, then 
Sconv. coincides with the regularly convex envelope K of S. Furthermore, Seon. 
is a weakly continuous one-to-one image of a compact metric set. 

Proor. As K is weakly closed the set 8S’ = SC K. In virtue of theorem 21 
we can apply to S’ theorem 20, thus we obtain that K = Stony. is a weakly con- 
tinuous image of a compact metric set. Since furthermore the set Stony. coin- 
cides in this case with the set Scony. the theorem is proved. 


2. Weakly compact sets in H. Every element x «EF generates an element 
X ¢ H** (E** denotes the conjugate space to the space #*) by the formula 


X(f) = f(x) (fe). 


This element X will be denoted by 2**. 

If a sequence {2,}f converges weakly to an element 2» , then the corresponding 
sequence {x, }f converges weakly (as a set of functionals defined over E*) 
to the element x;". This permits us to deduce from the theorems which have 
been proved in the preceding paragraph some properties of weakly compact 
sets in the space Z. The weak compactness of sets in E is understood here in 
the same sense as in §2, i.e. a set G C E will be said to be weakly compact in E 
if every sequence {x,}1 C G@ contains a subsequence {z,,} which converges 
weakly to an element 2» ¢ E. 

Thus according to this definition if a set GC EF is weakly compact in E the 
corresponding set G** C E** is weakly compact in E** as a set of functionals 
defined over E*, the converse in general not being true. 
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TuroreM 23. If GC E is a weakly compact set in E its weak closure G (i.e. 
the totality of all elements of G and their weak limits) is weakly compact and weakly 


closed. 
Proor. Take an arbitrary sequence {zm} C G. Choose in G for every x» 
a sequence {Zmn}n=1, Which converges weakly to x, (n = 1, 2,---). Denote 


by E; the linear, closed envelope of the set G. = {mn} (m,n = 1,2,---). As 
every closed linear subspace of E is weakly closed, we have G; C EF; and there- 
fore G, is separable. Considering now G, and G, as sets in E** and applying 
to them theorem 21 we conclude that G, is a weakly continuous image of a 
compact metric set and consequently is weakly compact and weakly closed. 
Hence the sequence {z,} C G; contains a subsequence {z,,} converging weakly 
to an element 2 € G; C G, which proves our theorem. 

TueorEM 24. If a set GC E is weakly compact in E, so is the convex envelope 
K of G. 

Proor. Let {a,}7 GC K. Then every z, (n = 1, 2,---) is a limit of a 
sequence of points z of the form z = , Where y; €G, > = 1,2, --- ) 
and )} 4; = 1. Therefore there exists a countable set G, C @ such that its 
convex envelope contains the sequence {z,}~. The linear, closed envelope FE; 
of G, is separable and consequently so is the set G, CE,. If we show that the 
convex envelope of G, is weakly compact in E, then we can state that the 
sequence {z,}{ C G, contains a subsequence {z,,} converging weakly to an 
element of Z, which, consequently, proves our theorem. Thus we see that to 
prove our theorem it is sufficient to prove it in the special case when E£ is sepa- 
rable and the set G C E is weakly closed. : 

Consider the set G** C E** of all elements x** ¢ E** corresponding to the 
elements x ¢G. The set G being supposed weakly compact and separable, then 
so is the set G** (considered as a set of functionals defined over E*). Therefore 
we can apply to G** theorem 22 and conclude that H, the regularly convex 
envelope of G**, is weakly compact and weakly closed. Thus to show that the 
convex envelope of G is weakly compact in E£ it is sufficient to verify that every 
element of H is generated by an element of E. 

Let Xo ¢ H; then by theorem I Xo = po(G**), where po € Bos. Consequently, 
for every fe Xo(f) = poly), where g/(x**) = x**(f) = f(x) 
The space E being separable, in order to prove that there exists an % ¢ E such 


. that X) = x**, ie. such that Xo(f) = f(2o) (f « E*), it is sufficient to show (see 


Banach [2], p. 131) that the functional Xo(f) (f « E*) is weakly continuous. 
Let a sequence {f,}f « E* converge weakly to an element fy ¢ E*. Then 


(56) lim gy,(X) = (X « G**). 
Let L be the linear space of all weakly continuous functions g(X) (X ¢« G**), 


which we shall consider as a subspace of Qe-+ (see Chap. I, §1). According to 
theorem 21 we may introduce in G** such a metric that G** becomes a compact 
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metric set and L the set of all continuous functions defined over this set. Kyi- 
dently g; L (f « and 


Il = Sup, le(X)| 


where C depends only on G**. 

Consequently, the relation (56) means (see Banach, [2] p. 224) that the 
sequence {gy,} converges weakly to gy, in the sense of weak convergence of ele- 
ments of L and, consequently, of elements of L. But then 


lim polyy,) = 

ie. Xo(fn) — Xo(fo), which completes the proof of our theorem. 

Coroutiary. If a set G C E is weakly compact in E, then its convex closed 
envelope K is weakly compact in itself. 

In fact, K being convex and closed is also weakly closed.’* Consequently 
K is the weak closure of the convex envelope Ki of G. By theorem 24, K;, is 
weakly compact in F and consequently K is weakly closed in itself according 
to theorem 23. 


3. Invariant points of transformations. Generalizing some investigations of 
G. D. Birkhoff and O. D. Kellogg, J. Schauder among others has established the 
following theorem: 

A. Let F(x) be a continuous operation, which transforms a convex, closed set 
H C E into its compact part. Then F(x) has a fixed point, i.e. there exists such 
an xo € H that F(a) = 1%. ‘ 

Using this theorem, J. Schauder obtains, in addition, the theorem 

B. Let E be a separable Banach space and let H C E be a convex, weakly compact 
and weakly closed set. If a weakly continuous operation F(x) defined over H 
transforms the set H into its part, then F(x) has a fixed point xo ¢ H (F(x) = 2). 

The results of the preceding paragraph permit one to obtain from J. Schauder’s 
propositions A and B two generalizations of proposition B. 

THEOREM 25. Let H C E be a convex, closed set. If a weakly continuous 
operation F(x) defined over H transforms H into its separable and weakly compact 
part in E, then F(x) has a fixed point. 

Proor. Let H; be the smallest convex, closed set containing G = F(H). 
By the corollary to theorem 24 the set H, is weakly compact and weakly closed. 
Furthermore, H; , being evidently separable, belongs to a separable space /; . 
On the other hand we have, obviously, F(H,;) C F(H) C Hi. To prove our 
theorem it remains to apply J. Schauder’s proposition B to the operation F(z) 
considered over H,. 

TurorEM 26. Let K C E* be a convex, separable and weakly closed set. If a 


16S. Mazur [10], p. 80. 
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weakly continuous operation F(f) transforms K into its weakly compact part G 
in E*, then F(f) has a fixed point. 

Proor. Let Ki be the smallest convex and weakly closed set containing the 
set G = F(K). By theorem 22 K; is weakly compact, weakly closed, and 
separable. Denote by F the linear envelope of the set Ki. In virtue of Lemma 
Iof Chap. I, §4, we can built a linear, separable space Z, C E such that F; C E? . 
As E, is separable we can introduce a new norm in EY such that every sphere 
o(6, r) « H; and consequently the set K, becomes by the new norm a compact 
metric set in which the convergence in the new sense coincides with the original 
weak convergence (see the proof of the Lemma I, §1). After this our theorem 
becomes an immediate consequence of theorem A of J. Schauder. 
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| SOME PROBLEMS IN THE THEORY OF SINGULAR INTEGRAL 
EQUATIONS O1 
By W. J. 

(Received June 1, 1939) (1 
1. Introduction 
. ; Our present object is to investigate the solutions of the singular integral equations Cl 
(1.1) ole) — K(e, Welw) dy = 
(1.1) [ KG, dy = 0 
b for real values of the parameter d, under the assumption that 
| 1, 
(1.1b) f(x) Le (x on (0, 1)) 
it | and that the real symmetric kernel K(x) is measurable.’ = 
elit | We shall also consider the corresponding integral equation of the first kind. - 
Hei | Hie In the sequel various additional hypotheses with respect to K(x, y) will be (1. 
ca made. The equations (1.1), (1.1a) will be singular, inasmuch as it will be not Tk 
(1.2) [ [ Reward 
should exist. As is well known, the essential features of the Fredholm theory 
of integral equations will hold for (1.1), (1.la) when the integral (1.2) exists the 
(in the Lebesgue sense). (1. 
For \ non-real, as remarked by 7’. Carleman,” the situation is simpler than . 
for \ real. On the other hand, applications of integral equations to dynamics ai 
and quantum mechanics are of particular importance for \ real. This fact 
explains our present purpose. For \ non-real Carleman developed a highly (1. 
important theory of singular integral equations. This theory has been extended 
again for \ non-real, in a work by W. J. Trjitzinsky. q 
We shall denote by S the square (1. 
(1.3) 0 Sz, ysl. 
BS ' Measurability in this paper is in the Lebesgue sense. = 
it * T. Carleman, Sur les équations intégrales singulitres a noyau réel et symétrique (Uppsala, : 
i : 1923]; in the sequel referred to as (C). [An 
a 4 3 W. J. Trijitzinsky, General theory of singular linear integral equations with real kernels $ 
; [Transactions Amer. Math. Soc. (1939), 202-279]. In this paper kernels are considered by | 
ey | . which, while not necessarily of the type considered in (C), are repeated limits of kernels of § 
Carleman type. mal 
pro 
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For n a positive integer let EZ, be the part of S in which 


| K(z, y)| Sn. 
One may define’ K,(«, y) as follows: 
K, (2, y) = K(z, y) (in E,), 
(1.4) K,(z, y) =n (when K(z, y) > n), 
«K(x, y) = —n (when K(z, y) < —n). 


Clearly 
(1.4a) | K,(2, y) | | y) | s | K(a, y) |, lim K,(a, y) y)- 


In the sequel the kernels under consideration will be of one of the following 


three types. 
Type 1.A. The integral 


(1.5) [ 


exists for  # & (v = 1, 2, --- ), where the é, possess a finite number of limiting 
points, included amongst the points 
(1.5a) Im- 


The set (é,) will be taken closed. It is supposed that if one defines symmetric 
K (a, y) (6 > 0) by the relations 


(1.5b) y) = 0 Os <6;» =1,--- m), 
K® y) = K(a, y) 
then the integral 
1 pl 
[ 


exists for allé6 > 0. Moreover, 


(1.7) lim [ [K(m, y) — K(az,y)'dy=0 (1,22 = 


Type 1.B. The integral 
1 
(18) [ 
0 


exists for almost all x (on (0, 1)).° 


‘Cf. T. Carleman, La théorie des équations intégrales singulitres et les applications 
[Annales de l'Institut H. Poincaré 1 (1931), 401-430]; in the sequel referred to as (Ci). 

* A comprehensive theory, for \ non-real, for kernels of Type 1. A has been developed 
by Carleman in the greater part of (C). . 

* Kernels of Type 1.B have been involved in a number of developments due to Carle- 
man; cf. (C,) as well as a number of other papers by Carleman, giving applications to 
problems of dynamics and quantum mechanics. 
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Type 1.C. Nothing is added to the previously made hypothesis (to the effect 
that the symmetric kernel K(z, y) be measurable) except the following indirect 
condition. 

Associated with K(z, y) there exists a linear operator L.(é | h(x)) (€ a param- 
eter) such that the following five conditions hold: 

(1.9) LAé | K(x, y))'C Le (in y); 
(1.9a) | Le(é| Ka(a, y)) | < v(&ly) (in y; | y) independent of n), 
where K,(z, y) is from (1.4); | 


(1.9b) lim LAE | y)) = K(z, y)); 
lim = La(E|f(a)), whenever fn(x) C Le 
(1.9¢) 
and fn(x) —> f(x) (weakly) ;’ 


[ 


for all C 

It is clear that the types of kernels, 1.A, 1.B, 1.C, as described above, are of 
increasing degrees of generality, in the order stated. 

For kernels which are merely measurable to (1.1), (1.1a) there will correspond 
the equations 


(1.10) [ Kale, Welw) dy = f@), 


(1.10a) g(t) K,(2, dy =0 (K,(a, y) from (1.4). 


On the other hand, for kernels of Type 1.A, corresponding to (1.1) and (1.1a) 
we shall also have 


(1.11) oa) KC, Woy) dy = 


(1.11a) g(t) K (a, ye(y) dy = 0 (6 > 0), 


where K(x, y) is from (1.5b). 
Finally, of importance for kernels of Type 1.C will be the equations 


(1.12) | [ | K,(2, y)) oly) dy = | f(x)), 


f(z) weakly, if lim [ fale) de = [ f(z) de. 


* Conditions (1.9), --- , (1.9d) are analogous to those employed in (C; p. 138) in an 
investigation (for \ non-real) of equations whose kernels belong to a variety somewhat 
less general than that involved in Type 1.C. 
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(1.12a) [ LAé | y)) ey) dy = L.(é| f(x), 


where L is an operator satisfying (1.9), --- (1.9d). 
2. A lemma valid for all equations (1.1) 
Suppose K,(z, y) is defined by (1.4). Inasmuch as K,,(z, y) is measurable and 
| Kn(z, y)| Sn, 


it is inferred that the corresponding homogeneous integral equation (1.10a) 
possesses a set of real characteristic values 


(2.1) Any = 1,2,-. 


containing at least one value; moreover, for n fixed, the set (2.1) will have no 
finite limiting values. Let the corresponding characteristic functions, arranged 
as an ortho-normal set, be 


(2.1a) Pn,»(Z) = 1,2,---; C Le); 
thus, 
(2.1b) — Yas [ Kal, dy = 0. 
Form an infinite subsequence 
(2.2) n; (j = 1,2,---3;m < m<---) 


of the set (n) of positive integers. Let G be the set of points represented by the 
numbers 


(2.2a) (j,v = 1,2,---). 


The closure’ G of G may consist of all the points of the real axis in the complex 
\-plane. 

Throughout this paper we proceed under the hypothesis that at least for some 
subsequence (2.2) the above set G does not consist of all the points of the real -axis. 
@ being closed, the set CG” will then be open and, thus, will contain an interval 


Whenever K(z, y) is of Type 1.A, we form an infinite subsequence 
(2.4) 6; (6; > 0;j = 1,2, ---; lim 4; = 0) 


and designate by G the set of points represented by the numbers 
(2.4a) (j,v = 1,2,---), 


°G = @ + limiting points of G. 
"CG = real d-axis — 
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where the d{°" [vy = 1, 2, --- ] form the set of characteristic values for the equa In v 
tion (1.1la) (with 6 = 4,)." The corresponding characteristic functions term 
; (arranged as an ortho-normal set) 
will satisfy 
1 
(2.40) ofa) [ KG, dy = 0. 
0 
Hi, Whenever it is explicitly stated that K(x, y) is of Type 1.A the supposition will Desi 
\ a be made that at least for some subsequence (2.4) the closure G of the set G (referred tion 


to in connection with (2.4a)) will not consist of the whole of the real \-axis; CG will 
certainly contain an open interval 


(2.5) A= —A| <o}. (2.1( 


DEFINITION 2.1. With an interval A, defined by (2.3) or (2.5) (as the case may 
be), let d denote a sub interval, 


Appl 
(2.6) d= {|b —A| Sh} (0 <h <a), 
One | | of A. We then let Sa designate the set of points consisting of the interval d and of 
i ee the points of the \-plane corresponding to all the non-real values i. 
For \ on d In vi 
Mi i : When K(x, y) is of Type 1.A then, for on d, 
: H He Let \ be fixed on d and let ¢,(x) C Le (n = n;) be the solution of (1.10), (2.10 
{ 1 
(2.8) gn(x) — dr i K,(2, y)en(y) dy = f(z), we h 
0 
for this value \. We multiply (2.8) by n(x) dz and integrate, obtaining” 
1 1 pl 1 
(2.8a) [ | = d [ [ da dy + [ de. Cons 
becat 
Before establishing an inequality for the double integral in (2.8a) we shall 
consider the function 
1 
(2.9) ra(a) = Kale, dy. 
0 (2.11 
: 1 Note the statement with respect to (1.6). 


12 The double integral in (2.8a) is real since K,(zx, y) is symmetric; thus, the imaginary 
part of the integral last displayed in (2.8a) is zero. 
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In view of a known theorem, 7,(x) can be expanded in a convergent series in 
terms of the functions (2.la), Thus 

— 1 
(2.9a) = — One 


v=1 An, 


where 
200) ear = dy; = dy. 


Designate by w, the double integral in (2.8a); by (2.9), (2.9a), (2.9b), integra- 
tion term by term being permissible, we have 


= Tn(X)@n(x) dx = [ Pn(X)gn(x) dx 


Applying the Schwarzian inequality to the series last displayed, one obtains 
|’ <> lene 


In view of (2.9b) and of Bessel’s inequality 


(2.10) 


(2.10a) |w, [ | (n = nj). 
On writing 
2106) fur = Sle de 
we have a known relation 
[A ond; n = n;3j,v = 1,2, +--+]. 


Consequently, in view of (2.10a), of (2.7) and of Bessel’s inequality (applicable 
because of (2.10b)) 


< [ lente) Pde 


| 
| 
|| 
4 4 
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Application of (2.11) and of the Schwarzian inequality to (2.8a) will yield 
1 1 4 1 4 


< E + | [ Pac || [ 


Accordingly, for 7 = 1, 2,---, 
2 pl 
|A| 2 


An analogous inequality will hold when K(z, y) is of type 1.A. 

Hence the following Lemma has been established. 

Lemma 2.1. Let symmetric K(x, y) be merely measurable. Let d be an interval 
(supposed to exist) as specified in Definition 2.1. With gn;(x) C Le designating 
solution of (1.10) for a value d fixed on d, we shall have the inequality (2.12) satisfied 
as stated.” If K(x, y) is of Type 1.A and d is an interval (supposed to exist for 
some subsequence 6; ; cf. (2.4)) as described in Definition (2.1), one will have 


(2.12a) [ (x) Pde < E [ \f(a) =1,2,--+], 


where g(x) & Ls is solution of (1.11) (with 6 = 6,) for a value d fixed on d."* 


3. Existence of solutions for non-homogeneous equations 


We recall a theorem of F. Riesz according to which inequalities 
1 
(3.1) [ |h,(x) Pde < M (v = 1,2, ---; M independent of ») 
0 


imply existence of a subsequence {h,;(x)} (v1 < ve < --- ) and of a function 
h(x) so that 


(3.1a) lim h,,(x) = (weakly),’ 
while 
(3.1b) "| A(z) Pde < M. 


With symmetric K(x, y) merely measurable in fixed on d, the above theorem, 
together with inequalities (2.12) of Lemma 2.1, will imply that there exists a 
subsequence {gm;(x)} of {¢,;(x)} and a function g(x) C Le so that 


8 The nj(j = 1, 2, ---) in (2.12) are the numbers (2.2), involved in (2.2a) in connection 
with the definition of G; also see the italics subsequent to (2.2a). 

4 The inequalities (2.12), (2.12a) are analogous to certain inequalities obtained in (C) 
for \ non-real; it is to be noted that in the latter case selection of subsequences (nj) ot 
(6;) is not necessary. The formulas stated in (C) for \ non-real are not valid for \ real. 
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(3.2) lim ¢m;(z) = ¢(@) (weakly) 
7 
and 
s [1+ PUT =o 

Unless additional hypotheses are introduced it is generally out of question 
to examine the possibility of g(x) representing a solution of (1.1). We shall 
accordingly examine the situation when K(z, y) belongs to one of the three 
Types 1.A, 1.B, 1.C. 

When K(z, y) is of Type 1.A application of (2.12a) enables one to infer exist- 


ence of a subsequence {o"?(x)} of {¢°?(x)} such that there exists a function 
for which 


(3.3) lim (x) = o(z) (weakly), 

(3.3a) | o(x) dx < q [q from (3.2a); fixed on d]. 
Now, by definition of g(x) (6 > 0) 

62) =r [ KC, do) dy + 50) (6 = 4). 

According to a known theorem,” if f,(z) and g,(x) C Lz and 

(3.5) [ f(a) Pdx <M 


and f,(z) — f(x) weakly, while lim g,(z) = g(x) (almost everywhere) and 
\g(x) | < y(x) C Le, then 
1 1 
(3.5a) lim fAx)gAx) dx = f(x)g(x) dz. 
v 0 0 
In view of the convergence of the integral (1.5) and since by definition (1.5b) 


| K°? (x, y) | | K(a, y)| C La (in y), 


on taking account of (2.12a) (with 6; replaced by 6;) and of (3.3), in consequence 
of the result [(3.5), (3.5a)] it is inferred that 


(3.6) lim K°? (a, y)e®? (y) dy = K(z, y)e(y) dy 


[A fixed on d; x ¥ &, (ef. statement in connection with (1.5a))]. 


(C; 182-133). 
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By virtue of (3.6) and (3.4) it is deduced that the limit 

60) tim = vl) = K(x, dy +400) 

exists (for z & = 1,2,---). By (3.3) 

(3.6b) lim (x) dx = [ dx = absolutely continuous function. 
On the other hand, 

(3.6c) lim dx = lim 4? (z) de = ole) de, 


inasmuch as the passage to the limit under the integral sign in (3.6c) is justifiable 
by virtue of (3.3), of (2.12a) (with 6; replaced by 6;) and of theorem [(3.5), 
(3.5a)]. By (3.6a) and (3.6c) 


[ v@a = [ 


and, accordingly 
v(x) = g(x) (almost everywhere on (0. 1)). 


This, in view of (3.6a), implies that the function g(x), involved in (3.3), ts a solu- 
tion (almost everywhere) of the equation 


1 
(3.7) => Wel) dy (K(q, y) of Type 1.) 
for fixed on the interval d. More precise results for kernels of Type 1.A will 
be given in Theorem 3.2. 
Suppose now that K(z, y) is of Type 1.B. In place of g”?’(x) we now have 
¢m;(t) (ef. (3.2), (3.2a)). Repeating the argument from (3.3) to (3.7), with 
K°? (x, y) replaced by Kn,(x, y) and noting that 


| Km;(a, y) | S| K(a,y)| C Le (in y; almost all 2) 


(cf. (1.4a) and (1.8)), it is observed that the function g(x) of (3.2) satisfies for 
almost all x (on (0, 1) the equation 


1 
(3.74) (a) =r] K(x, Wel) dy + $0) (K(x, y) of Type 
for d fixed on the interval d. 
If K(z, y) is of type 1.C there exists a linear operator L satisfying, with respect 
to K(x, y), the conditions (1.9), --- (1.9d). Let {gm,;(x)} be the sequence, 
involved in (3.2), for this kernel K(z, y) and for \ fixed ond. Thus 
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(3.8) | om;(x) S$ q° (ef. (3.2a)), lim = g(x) (weakly). 
From (3.8) one obtains 
(3.9)  Le(E|em;(z)) =r LAE | Km, (a, y))om,(y) dy + |f(x)). 


By (1.9¢) and (3.8a) 
on lim Le(E | = | o(2)). 


On noting that (1.9b) and (1.9a) (with n = m,) hold, in consequence of (3.8a) 
it is inferred that theorem [(3.5), (3.5a)] is applicable, yielding the result 


In view of (3.9a) and (3.9b) from (3.9) it follows that g(x) satisfies the equation 
(1.12a). We sum the above results in the following theorem. 

THEoREM 3.1. It is recalled that when K(ax, y) is of Type 1.B or of Type 1.C 
there exists, by hypothesis, a sequence (n;) (2.2), associated with which there is 
on hand an interval A (ef. (2.3) and the text from (2.2) to (2.3)). Similarly, when 
K(x, y) is of Type 1.A there exists, by hypothesis, a sequence (5;) (2.4) and an 
associated interval A (cf. (2.5) and the text from (2.4) to (2.5)). In either case, 
let d denote a sub-interval (2.6) of A and let denote a fixed value in d. 

In the case of Type 1.A, for a subsequence {y°?(x)} of {g°?(a)} we have (3.3) 
and the function p(x) involved in (3.3) will be a solution for the value d in question 
and for almost all x on (0, 1) of the equation (1.1). 

In the case of Type 1.B, for a subsequence {ym;(x)} of {yn;(x)} we have (3.2) 
and g(x) (in (3.2)) will be a solution for the above d and for almost all x on 
(0, 1) of the equation (1.1). 

When K(x, y) is of Type 1.C so that an operator L, satisfying (1.9), --- (1.9d) 
is associated with K(x, y), we have (3.2) for a subsequence {gm;(x)} of {¢n;(x)}; 
the function g(x) involved in (3.2) will be a solution, for the value d in question, of 
the equation (1.12). 

In all cases the solution g(x) (for the above d) will satisfy the inequality 


With symmetric K(z, y) merely measurable we note the relation 
(3.10) = [ y)em;(y) dy + f(x). 
When K(z, y) is of Type 1.A one has 


(3.10a) [ K°? (a, (y) dy + f(a). 
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From (3.10) and (3.10a) with the aid of the Schwarzian inequality it is estab- 
lished that, when K(z, y) is of type 1.B, 


(3.11) | Ki,,(z, y) [ | om; (y) Pay | + |f(x)|, 


(3.11a) | — f(z)| [ Kin y) [ lem; (y) 


and that, when K(z, y) is‘of Type 1.A, we have with 6 = 5; , 


(3.12a) —f(z)| [ K®*(a, y) [ |e (y) Pay, 
(ar) — f(as)] — (x2) — 
3.12b 1 1 
[K° (a1, y) — K® (ae, dy [ |e (y) 
0 0 


In (3.11), (3.12b) 

(3.13) Xi: = upper bound of | \ | for A on the interval d. 

By virtue of (2.12) and (2.12a), as well as in consequence of the relations 

| Km;(x, y) | K(x, y)|C (iny), | K°?(a, y)| S| K(w,y)|C Le (iny), 


valid for the Types 1.B and 1.A, respectively, the following is deduced from 
(3.11), --- (3.12b). 
For the Type 1.B, almost everywhere, 


1 4 
.14) | dal dy] +1900 
1 
(3.14a) | gm, (x) — f(x) | S ug | [ K*(ca, y) av | (q from (3.2a)). 
For the Type 1.A, for 2, 2, 22 ¥ & (v = 1, 2, --- ) and for A on d, 


4 
(3.15) |e"? (2) | < K°(z, y) ay | + |f(z) |, 


(x) — fled] flea 


1 
[ [K(ai, y) — K(2e, dy [j sufficiently great]. 


(3.15b) 
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Thus, for kernels of Types 1.B, 1.A the sequence of functions 
(x) }, (x) 


employed in the construction of the solutions of non-homogeneous equations, 
as specified in Theorem 3.1, are bounded (in absolute value) uniformly with 
respect to \ (A on the interval d) and with respect to j. 

When K(z, y) is of Type 1.A, on noting (1.7) it is observed that the condition 
(3.15b) signifies that the functions of the sequence 


(3.16) (2) J(z) (j 1, 2, ) 


are continuous, for z ¥ & (v = 1, 2, --- ; 2 on (0, 1)) uniformly with respect 
to \ (A on d) and uniformly with respect to 7 (when z is on any closed subset 
of (0, 1) — {&}); the latter property of uniformity in accordance with the 
usual terminology is expressed by saying that the sequence (3.16) is equicon- 
tinuous. Thus, for a fixed value of \ (on d) a subsequence {g“!(x)} can be so 
selected that the limit 


(3.16a) lim (x) = 


exists for all z ¥ & (v = 1,2,---). Now, the functions ¢“(x) are analytic 
in \ for A on Sq (Definition 2.1); on the other hand, the (4;) can be so selected 
that the limit (3.16a) exists not merely for a single value of \ (on d) but exists 
uniformly (with respect to \) for \ on a preassigned denumerable set situated 
on the interval d;’* these considerations lead to the conclusion that the sequence 
{e“?(x)} can be so selected that the limit (3.16a) exists for z ¥ & (v = 1,2, --- ) 
and for all \ in Sg ; moreover, in Sa ¢(x) will be analytic in A. Since ¢g(z), as 
defined by (3.16a), is a particular function referred to in Theorem 3.1, it clearly 
is a solution C Le of the equation (1.1). 

THEOREM 3.2. Let K(x, y) be of Type 1.A. The subsequence {p“i(x)}, in- 
volved in Theorem 3.1 for the purpose of constructing a solution of the equation 
(1.1), can be so selected that the limit 


lim ¢°?(x) = ¢(z) 


exists for all x # & (v = 1, 2,---) and for all d in Sq (Definition 2.1) and so 
that for every fixed x ¥ &, (v = 1, 2, --- ) g(x) is analytic in d for Nin Sa. For 
these values of x and d ¢(x) will constitute a solution C Le of the equation (1.1). 
Moreover, g(x) — f(x) will be continuous in x (for x # & ;v = 1, 2,---) in 
accordance with the inequality 


— flas)] — — Sh [ [K(a:, y) — K(ae, dy 


(v = 1, 2, ---); ef. (1.6)] 


We note that the second members in (3.15), --- (3.15b) are independent of A. Use is 
made of a theorem of Vitali. Compare with (C). 
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where, for d on d, 
h= (cf. (8.13), (3.2a)) 


and, for \ non-real, h is a value given by Carleman.” 

We shall now prove the following theorem. 

THEOREM 3.3. Let K(x, y) be of Type 1.A or of Type 1.B. Ford on an interval 
d a solution g(x) C Le of the equation (1.1) is given by the formula 


1 


ola) = sa) +0 oe, Sw) ay 


where 0(x, y | u) is a spectrum of K(x, 
To establish this theorem suppose first that K(z, y) is of Type 1.B. The 
spectrum of the equation 


=r] Kale, Welw) dy 
will be 
on. (x) on, (y) [10 < An,» <A; ford > 
— on ly) <0; ford 


while @,(x, y | 0) = 0."° In consequence of the classical theory, which is applic- 
able since K,,(x, y) is measurable and | K,(z, y) | S n, the solution of (1.10) C L, 
is expressible in the form of a Stieltjes integral, 


y | d) 


= +0 ay y ay 
= so) +r dy 
(3.18) + [ d, [ sone y dy 
= se) +r Kala, $0) dy 


1 pl 
+» 4 [ [ K(x, ds at” 


17 Cf, (C). 

18 This theorem is analogous to a result given by Carleman, for \ non-real, in (C; pp. 
66-68). 

19 This is in accordance with (C; p. 26). 

*0 (3.18) is analogous to a formula given in (C; p. 67) for the Type 1.A; (3.18) amounts 
to a rewriting in terms of Stieltjes integration of relations which are derived without 
difficulty from the classical theory. 
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Now 
3.18) lim [ dy = K(x, ay, 


since f(y) C Le and since 
K,(z,y) K(a,y), | Knr(z,y)| S| K(x, y)|C le (in y); 


in fact, these conditions enable application of the theorem in connection with 
(3.5), (3.5a). 


We write | 
3.19) C t|u) s)f(t) ds dt = Ci, + 
where 


1 pl 


1 pl 
Let us express Ci, as | ; 
(8.20) = CA) + RA, 
where 
l 1 


(3.20b) = ( + dy [ 6,(s, t | w)K(a, s)f(@ ds dt. 


Keeping \ on the interval d and letting | > | do| + h, it is observed that, 
in R,()), 


1 1 
consequently 
(3.20c) s [ | dt [ K*(z,s)dsSé (e>0) 
~ (L—| do] — h)? 4o 0 


(for 1 = U(e, z)), as can be deduced with the aid of a result in (C; p. 31).” . 


* This result gives an inequality for the variation of the double integral involved in 
(3.20b). In this connection, as well as on several occasions in the sequel it is to be noted 
that the formulas referred to in (C) are applicable not only for kernels of Type 1.A, but 
also for kernels of Type1.B. This is true, of course, not for all results established in (C). 
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Consider now C,,(l) (3.20a), 
(3.21) = An) + BAD, 


where 


(3.21a) = 


+ 4% [ [ On(s, t | K(x, s)f(t) ds dt, 


It is observed that in A;()) 


| 1 


thus, for \ fixed on d the function of yp, 5) is continuous in yu for u on each 
of the closed intervals 
— »), (Xo + 1); 
hence application of a result in (C)” will yield 
lim A,(l) = 


[[ t| K(a, s) f(t) ds dt. 


6(s, ¢| = lim @,(s, t| 4) = spectrum of K(s, t); 


(3.21¢) 


Here 


it is understood that passage to the limit is through a suitable subsequence 
of values n. 

In view of the expression for 6,(s, ¢ | 4) in terms of the characteristic functions, 
from (3.21b) one obtains 


BA) =D! [ dedt 


(3.21d) 
[ K(z, 8) ds 
[n = m;3fn,; = Fourier coefficient of f(s)]. 


In (3.21d) the sum is over those values »v for which the points Ax, are in A. 
Now, by hypothesis, there are no points \,,, in A; hence 


(I) = 0. 


22 (C; formula 45; p. 32). 
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In consequence of (3.20), (3.20c), (3.21) and (3.21e) 
— An | = | Ra | Se (for 1 2 Ue, z)). 


Hence, by (3.21¢), 


lim Cy, = ( + [.) [integrand from (3.21c)] 
(3.22) 


[ [ K(z, s)0(s, t | u) f(é) ds dt. 


We turn our attention now to C’, (3.19b), 


(3.23) Ch = A, + Bi, 
where 
1 pl | 


(3.23b) Ay = ( “ng + | se [integrand as in (3.23a)]. 


It is observed that By is Bi(l) (ef. (3.21b)) with K(a, s) replaced by 
K,(z, s) — K(a, 8); thus, in place of (3.21d) one obtains 


Be =D — KG, enol ds, 


it, 
where the summation symbol has the same significance as in (3.21d). Hence : 
(3.23¢) B = Q. 
With reference to (3.23b) it is noted that, inasmuch as ) is on d, 
ce 1 
= w—h’ 
accordingly, 
| | 1, Var. [ [ t)[Kn(a, 8) — K(x, s)] f(t) ds dt. 
Whence, in view of (3.23) and (3.23c), an application of (C; p. 31) will yield 
1 1 
I. which implies that 
. (3.24) lim C; = 0. 


By virtue of (3.19), (3.24) and (3.22) it can now be asserted that 


(3.25) lim Ca; = a d, [ [ K(z, 8)0(s, t| u)f() dsdt = C 


| 
| 
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for a suitable subsequence (m;) and for \ on d. Thus, on taking account of 
(3.18a) and of (3.18), it is inferred that for almost all x (on 0, 1)), K(z, y) being 
of Type 1.B, we have 


tim = (2) = fle) +2 [ wf) dy 


(3.26) 


+2? [ [ K(a, 8)0(s, dsdt 


[A on d; 0(s, t|4) = spectrum of K(s, t) corresponding to (m,)]. 


The function g(x) so defined will C Le and will be a solution of (1.1) for almost 
all x on (0, 1). 

With d on d, it is possible to show that (3.26) implies (3.17). A similar result 
is established, following the procedure given above, when K(z, y) is of Type 1.A. 
Thus, the theorem is seen to hold as stated. 

Suppose now that K(z, y) is of Type 1.C, L denoting a corresponding linear 
operator. With 6,(z, y | A) defined as stated subsequent to Theorem 3.3, form 
the function 


(3.27) ' y dx dy = On(x, y 


A subsequence m; of the sequence (2.2) (involved in the definition of () can be 
found so that the limit 


(3.27a) lim Qn y|A) = Q(a, y | d) 


exists and possesses most of the essential properties enjoyed by the function so 
denoted in (C)—in Carleman’s treatment of kernels of the kind somewhat less 
general than Type 1.C.¥ Following the procedure given in (C) (Chapter IV) 
it can be shown (for the kernels of Type 1.C) that, corresponding to every limiting 
function Q(x, y |) (3.27a), the equation 


(3.28) e(x)) — LAE | K(x, y))e(y) dy = L.(é|f(a)) 


will possess a solution C Le of the form 


(3.28) = se) +02 [ so Zale, 


(almost everywhere on (0, 1)), provided d is non-real. 

An analogue to Theorem 3.3 will be as follows. 

THEOREM 3.4. With K(z, y) of Type 1.C and X on an interval d, a solution 
g(x) C Le of the equation (3.28) is given by the formula (3.28a), where Q(x, y | #) 
is a function of the form (3.27a). 


2 Cf. Chapter IV of (C). Also see Triitzinsky, loc. cit. 
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To establish this result we first note that in consequence of the classical 
formula for solution gm,;(x), C Le, of (1.10) (with n = m;) we have 


320) ome) +92 sy) 200,02, vlad ay 


where Qn,;(z, y |) is defined by (3.27). In view of the preceding, the m;, 
involved in (3.27a), may be selected so that 


(3.29a) [ lym; (x) dx (x) (weakly). 
We also shall have 
(3.29b) | af Lig | Kn; (a, Y)) em; (y) dy = LAE | f(z)). 


The function g(x), referred to in (3.29a), will be a solution of (3.28). Accord- 
ingly, it is observed that the theorem will be established as soon as it is shown 
that 


(3.30) =[. dy h(z, 
where 
a 
hie, w) = $0) dy, 
(3.30a) 


1 
nx, w) = uw) dy. 


Now, for » on A 


Om; (x, y | 0. 
Hence 


Qm,; (2, y | = 0, Q(z, y | = 0 (u on A). 
Whence (3.30) will hold if 


(3.30b) lim x w) — hile, = 0, 
where 
(3.30¢) I(A) = +”) —A. 
Write 
a0) — =f +f) 
with 


where 1 ig sufficiently great so that the interval (—1, J) contains A. 
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For on A) 


1 1 


hence 1/(u — d) is continuous in » for zon I(l| 4). Thus by a theorem of Helly 
(3.31) tim — hi(z, = 0, 


inasmuch as h;(z, h(a, and 
Var. hj(z, S 


(a independent of j); 


the latter inequality being essentially a consequence of some of Carleman’s de- 
velopments. On the other hand, 


+ x u) — Aj(x, »)) 
(3.32) 


where 
d(x, 1) = lesser of — + 


We keep x (on (0, 1)) and A (on d) fixed. Choose / sufficiently great so that 


<5, 


which is possible by virtue of (3.32). Making use of (3.31) we then take j(€) 
so that 


(for all j = j(0). 
4) 2 
Thus, by (3.30d) 
[ w)| « (j 2 il). 
1(4) B 


Accordingly, it is observed that (3.30b) holds. The truth of Theorem 3.4 is 
now evident. 


4. Questions of uniqueness 


Results relating to uniqueness properties for kernels of Type 1.A, for A non- 
real, have been given by Carleman.™ We shall establish the corresponding 
results, for kernels of Types 1.A, 1.B, when A is in Sa, with Sa specified by 
Definition 2.1. 


24 (C; Chapter II). 
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We shall first prove the following theorem. 

TueorEeM 4.1. Let K(a, y) be a kernel of Type 1.B. Let X be in Sa. With 
(n;) designating a sequence (2.2), involved in the definition of the interval d, there 
exists a subsequence (m;) independent of f(x) so that, designating by gm,(x) the 
solution of 


1 
(41) => dy +502) (fez) La, 
we have 
(4.1a) lim ¢m;(x) = g(x) = Ti(f| x) 


for all f(x) C Le; moreover, g(x) will satisfy (1.1). 1 (f| x) will be a linear func- 
tional of f(x) such that 


1 1 
(4.2) ae = fla) |2) ae (A in S.), 
whenever fi(x), fo(x) C Le. 

For kernels of Type 1.A a similar result will hold for some subsequence (8;) (cf. 
(2.4)), with gm;(x) and Km;(x, y) replaced by (x) and (a, y), respectively. 


To demonstrate the above we confine ourselves to the Type 1.B and denote 
by i~n; the solutions C Le of 


1 

(4.3) = [ Kn y) dy + [fi(x) C Le; = 1, 2]. 
Then, as is known, 
(4.3a) de = ae. 
We have, with A in Sz, 

1 1 
In consequence of Lemma 2.1, for \ on d we may let 


(4.4a) Mx(A) = (1 + [ \fi(x) = qi (¢ = 1, 2) 


1 
w-h 


[A: = upper bound of |\| for A on d]. 


On the other hand, in view of (C), for \ non-real one may write 
2 pl 
(4.4b) Ma) = [ | fala) |? da = 1,2), 
B® Jo 


Where 8 is the imaginary part of A. Thus M,(d) is finite for every \ in Sa. 
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_ We have shown previously how one may select a subsequence (mj) of (n,) 
so that 


lim (2) = ¢(z) 


exists almost everywhere, C Le and constitutes a solution of (1.1). The sub- 
sequence (m;) will be selected independent of f.” Hence the relation (4. 1a) 


may be now asserted. It remains to establish (4.2). For the latter purpose we 


apply to (4.3a), where n; is replaced by m; , the result in connection with (3.5), 
(3.5a). This application is possible inasmuch as for \ in Sa we have the in- 
equalities (4.4), where the second members are finite (for every } fixed in S,). 
Hence passing to the limit and taking account of the already established for- 
mula (4.1a) one obtains (4.2) for every Xin Sa. 

Introducing certain slight modifications in the above developments, similar 
results are established for kernels of Type 1.A. 

From Theorem 4.1 it follows that if, for \ fixed in S,, the equation (1.1) has 
only one solution g(x) C Le, then g(x) = Ty(f| x) and (4.2) will hold for all 
filx), fo(x) C Le. 

TueoreM 4.2. Let K(x, y) be of Type 1.A or of Type 1.B. If ford = , fixed 
in Sa the only solution C Lz of the homogeneous equation 


(4.5) Wel) dy 


is zero,” the same will be true for all non-real values of 2. 

In view of Theorem II, (C; p. 55) it is known that if \; is non-real the above 
will certainly hold. Thus take \; on d. Suppose now that the Theorem is not 
true. There will then exist a non-real value \ and a corresponding function 
g(x) for which 


1 1 

(4.6) 0, oe) Ke, ay. 
0 0 

For this function we have 


(4.6a) g(x) — [ K(z, y)e(y) dy = (1 g(x). 


The equation 
(4.6b) oa) KG, DoW) dy = In) 


cannot have two distinct solutions since otherwise their difference w(z) will 
satisfy 


** The possibility of such a choice follows from a remark in the foot-note of (C; Pp. 56). 
*6 Almost everywhere, when K(z, y) is of Type 1.B; for « # &(» = 1, 2, -:*), when 
K(a, y) is of Type 1.A. 
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1 1 
(4.6c) [ K(z, y)w(y) dy = 0, [ | w(x) # 0, 


contrary to hypothesis. Thus (4.6b) has a unique solution C Lz ; hence in view 
of (4.6a) and of Theorem 4.1 the function g(x), involved in (4.6), satisfies the 
functional relation 


(4.7) g(x) = Th, (1 


On taking the conjugates of the members of (4.6a) it is deduced that 


(48) ate) — = (1 -™) av, 


so that in consequence of Theorem 4.1 and of the italics subsequent to (4.6c) 


(4.82) = Tr, (1 i2). 
Application of the same Theorem to (4.6a), (4.8) will yield 


[ (1 a(x) Tr, (1 |x) de 
Thus, by (4.7) and (4.8a) 


(4.9) Pde = (:-*) flee dz. 


By virtue of (4.6) from (4.9) it is inferred that \ must be real, which is contrary 
to hypothesis. The theorem is accordingly established. 

In (C) Carleman gave conditions under which kernels are of Class I; that is, 
possess the property that for some (and hence all) non-real values of \ the 
homogeneous equation has zero as the only solution C Lz. In view of Theorem 
4.2 the following may be asserted. 

In order that a kernel K(x, y) (of Type 1.A or of Type 1.B) be of Class 1 it is 
sufficient that for some value of d on d the only solution (C Lz) of the homogeneous 
equation should be zero. 

We shall now prove the following theorem. 

TurorEM 4.3, Let K(x, y) be of Type 1.A or of Type 1.B. The number m of 
distinct” solutions (€ Ls) of 


(4.10) g(t) =X [ K(z, y)ely) dy 


* ‘Distinct’, here and in the sequel signifies ‘linearly independent.’ 
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for any d fixed on d, is equal to or is greater than the number n of distinct solutions 


for non-real values of x. 
We first note that the number of distinct solutions is the same for all non-real 
values of \.” Suppose the theorem is not true. We then have n, n > m, 


distinct solutions ¢, (x), 
(4.11) = (y) dy =1,---n), 


for a non-real value \*. On the other hand, for a fixed on d, there exist by 
hypothesis m distinct solutions ¢;(z), 


(4.114) e(a) = Deity) dy (j= 1, m). 


Inasmuch as X and K(z, y) are real we may suppose that the ¢;(z) are real. 
Form a solution g*(x) for 


(4.12) g*(x) = Cy (x) 0); 
then 
(4.128) [ Pde #0. 


In addition, let the c, be so chosen that 
1 
(4.12b) [ ae = 0 


for all solutions g(x) satisfying (4.10) for the value X. The possibility of securing 
(4.12b) is inferred from the following considerations. One has 


g(x) = dj (x). 
Thus, (4.12b) will be satisfied if 


1 
¢*(x)p,(x) dx = 0 (j = 1, m). 
That is, we need to choose the c, (not all zero) so that 
n 1 
Lo ef dr = 0 (j = 1,2, m). 


Such a choice is possible since m < n. 
For the function (4.12) we have 


(4.13) y*(x) — [ K(x, y)e(y) dy =0, o*(x) — [ K(a, y)e*(y) dy = 9; 


28 (C; p. 58). | 
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these relations may be written in the form 


[ Ke, = (1- = se, 
(4.18a) 
or) KG, dy = (1 ane) = 


In view of Theorem 4.1 it is accordingly deduced that 
e*(x) = Ti(f |x) + 


= Tr(f| x) + 


where yi(x), ¥2(z) are certain solutions of (4.10) for 4. The operator 7)(h | 2) 
(\ on d) may be so determined that 


Ty(h| 2) = Tr(h| 2) (all Le); 


(4.14) 


we then have 


(4.148) = h(a). 
A further application of Theorem 4.1 will yield 


[ ae = [ 


that is, in consequence of (4.14) and (4.14a), 


By virtue of the statement with respect to (4.12b) 


1 1 
[ e@h@ar=0, =o. 
Whence 


). ( [ | o*(x) dx = ( > | p*(x) 


which by (4.12a) implies that AX* = X\*. Thus, A* must be real, contrary to the 
). — made in connection with (4.11).. Whence the theorem holds as 
Theorem 4.3 will still hold when the number of distinct solutions is infinite. 
If K(x, y) is of Type 1.C and if L,(é | h(x)) is a linear operator associated with 
K(z, y), in accordance with the conditions (1.9), --- (1.9d), theorems analogous 
0; to Theorems 4.1, 4.2, 4.3 will hold for the equation 


e(2)) — Le |K(e, wou) dy = Lele 


| 

| 

i 

iy 

ii 


608 W. J. TRIITZINSKY 


provided L,(é | h(x)) is real for h(x), C Le, real and provided 
L.(é | h(x)) = Le(€ | A(z). 


5. A further extension and examples 


Under certain circumstances solutions of the non-homogeneous equation may 
exist for \ real even if \ does not belong to any interval d of the description given 
in Definition 2.1. In this connection the following Lemma will be helpful. 

Lemma 5.1. Let K(x, y) be of Type 1.B. Designate by G a set 


(5.1) G = 


where (nj) forms an infinite subsequence of (n). Let T bea set on a finite part of 
the real axis in the d-plane, no point of T being coincident with a point of G.” 
With the equation (1.1) in view, write 


(n Ni3J,¥ 1, 2, ---), 


Form the series 
If 
(5.3a) | sn(A) | S s(A) (s(A) independent of n = nj 3m <m<---) 


where s(d) is finite for every in T, then 
(5.4) [ len(x) Pde (Ain 
with a(x) constituting a solution C Ls of (2.8) and | 
(5.4a) = + | az] 
To establish this Lemma we note that by (2.8a) 
(5.5) [ Pade = on a, 
where 


wa = [ decay. 


*° Tt may occur that every point of 7 is a limiting point of G. 

*° For n fixed the dn,» have no limiting points on the finite part of the axis of reals; } 
(in 7’) is distinct from \n,»(v = 1, 2, ---) and is not a limiting point of the An,»; hence > 18 
at a positive distance from the set (An,»)(» = 1, 2, --+ ; n fixed). Clearly, every series 
(5.3) will converge for ) in 7, inasmuch as | fn,: |? + | fa,2 |? + «++ converges. 
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In consequence of the first inequality (2.11) and of (5.3) 
1 


Whence by virtue of (5.5) 


which yields ! 
[ico {into +[f ise tae] 


ford in T andn = m,m,---. Finally, (5.4) will follow from (5.6) in conse- 
quence of the hypothesis (5.3a). This demonstrates the Lemma. 

With the aid of the above Lemma the following can be proved. 

THEorEM 5.1. The results stated in the previous sections for kernels of Type 1.B 
will hold (with obvious modifications) when intervals d (cf. Definition 2.1) are 
replaced by sets T (T' defined in Lemma 5.1) for which (5.3a) holds. 

In order to see the truth of this statement one needs only to repeat the de- 
velopments previously given, employing whenever necessary the inequality (5.4). 
Analogous facts will hold for kernels of Types 1.A and 1.B. 

In applications equations often occur in the form 


1 
= | K(@, dy +50), 
where K(x, y) is a kernel of one of the Types studied above. Our results could 
be applied whenever an interval d (Definition 2.1) or a set 7’, as described in 
Lemma 5.1, can be found so that \ = 1 belongs to d or to 7’, as the case 
may be. 

Under suitable conditions, many of the above results will hold when K(z, y) 
is approximated by regular (that is, belonging to Le in (a, y)) kernels, even 
when the modes of approximation are different from those previously used.. 

When the approximating kernels can be taken of positive type’ then the 
characteristic values are all positive and, clearly, any interval 


<0) 


could be used as d. 
A class of singular kernels may be constructed as follows. Let 


(5.7) (m, v = 1, 2,---) 


* A kernel A(z, y) is of positive type if Jf A(z, y)e(x)e(y) dx dy 2 0 for all g(x) C Le. 
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be real functions C L for x on (0, 1) and such that 


1 
(5.7a) de = 0 
1 
(5.7b) [ =1 
0 
1 
(5.7c) de = (mx 
Suppose real numbers \»,, are such that the series 
1 2 Vin, (2) 
converge (for almost all z on (0, 1)), while the series 
(5.8a) 8. 
diverges. 
Form the functions g»,(z, y), 
(5.9) n(x, y) = 
The integrals 
(5.9a) [fo dzdy = 8 


will exist. The gm(z, y) will be regular kernels; moreover, they will form an 
orthogonal set. The kernels . 


(5.9b) Kw (2, y) = gi(z, y) gn(x, y) (n = 1,2,-- 
will also be regular; in fact, the integrals 


exist. By (5.7a), (5.7b) 

[ way = DF 
Thus 


(5.9d) [ y) dy = gn(x, y) dy = 2). 


% Examples of such functions are easily given. 
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The kernel 
(5.10) K(x, y) = y) = lim y) 
will be singular of Type 1.B. The characteristic values for the approximating 
kernel K,(x, y) will be the numbers 
Amy» (m= 1,---n;vy = 1,2,---); 


accordingly, these values will be at our disposal, subject to the general condi- 


tions stated above. 
We shall obtain a kernel which is merely measurable by dropping the require- 
ment that the double series in (5.8) should converge. 


6. Equations of the first kind 
The study of integral equations of the first kind 


(6.1) [ KG, We) dy = [real f(z) Las on (0, 1)] 


where K(x, y) is of Type 1.A or 1.B, is reducible to that of another integral 
equation (also of the first kind but with a kernel which is not necessarily sym- 
metric), to which the classical methods of Picard and Lauricella are applicable.” 
On the other hand, when K(z, y) is of Type 1.C the integral 


[ Kena 


does not need to exist; consequently the reduction, referred to above, will be in 
general impossible for kernels of Type 1.C. Accordingly, in the sequel it will 
be supposed that K(x, y) is of Type 1.C; moreover, in place of (6.1) we shall 
study the related functional equation 


(62) Lal | K(a, y)oly) dy = |f(2)) (f(a) La), 


where L.(&| h(x)) is an operator [associated] with K(x, y), subject to the condi- 
lions (1.9), --- (1.9d). It is conceivable that in some cases 


(63) C1, (in &), LAE | f(z)) Cl, (in &); 


this would enable reduction of (6.2) to an equation to which the methods of 
Picard and Lauricella are applicable. However, (6.3) will hold not always. 
Accordingly, development of a different procedure for the study of (6.2) is 
desirable. 


* References to the papers of these authors, as well as certain other facts concerning 
equations of the first kind are found in J. Soula, L’équation intégrale de premitre espece a 
limites fixes et les fonctions permutables a limites fixes [Mémorial des Sciences Math., 80 
(1936)]. For the device referred to in the text see pp. 3-4. 
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Let K,(z, y) be the kernel defined in (1.4). This kernel will be regular and 
will possess characteristic values A,,, and characteristic functions ¢,,,(x). In 
terms of these numbers and functions we form the spectrum 6,(x, y | \) (ef. the 
text subsequent to (3.17)) of K,(2, y). While for kernels of Type 1.B it is 
possible to select a subsequence (6,;(x, y | A)) converging to a limiting function 
(x, y |X), termed spectrum of K(z, y), for kernels of Type 1.C existence of a 
spectrum cannot be asserted.” On writing, as before, 


(6.4) | y|A) = 6n(x, y |r) dx dy 


and on recalling that there exists a subsequence (n;) of (n) for which the limit 


(6.4a) lim y|) = y |) 


exists for almost all x, y in the square 
1 


‘and for all real values except perhaps for = p2, (for which values 
Q may be discontinuous in \), it is also to be noted that 


(6.4b) Var. A(x, y|d) < (ay)', Az, y|0) = 0; 


In accordance with the terminology of Carleman it will be said that © is ‘closed’, 
if the generalized Bessel’s inequality (which holds for kernels of Type 1.C), 


1 ar a ) 1 
(6.5) dh [ h(a) (2 iy) dy da, 
is valid with the equality sign (for all h(z) C Ls); one then has 


(6.5a) h(a) = a y dy 


for almost all x on (0, 1). 

We shall establish the following theorem. 

THEOREM 6.1. Let K(x, y) be of Type 1.C, with K,(a, y) designating the kernel 
(1.4), the characteristic numbers and functions of K,(x, y) being the dn (v = 
1, 2, ---) and the yn,,(x) (v = 1, 2, ---), respectively. 

Suppose that for a subsequence (n;) of (n) each sequence 


(6.6) Ong 22), 
forms a complete set and that the series 


* Cf. (C), where kernels less general than those of Type 1.C are treated in Chapter IV. 
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converge (forj = 1, 2, +--+) and 
(6.6b) A (finite A, independent of j). 
Then the equation (6.2) has a solution g(y) C Le. 


In order to establish this result we note that by the classical theory the condi- 
tions stated in connection with (6.6) and (6.6a) enable us to assert that 


the equation 

‘ 1 
(6. Welw) dy = S02) 
has a solution gn;(y) C Le. Clearly 


6.8) [ | Kaz, y))en(y) dy = Le(E |f(2)) (n = 


and, inasmuch as the v Fourier-coefficient of ;(y) 18 An; nj, by Parseval’s 
relation we obtain 


1 
(6.8a) [ ¢nj(y) dy =Tn;- 
Thus, in view of (6.6b) 
(6.9) [ s (j= 


In consequence of a known theorem (ef. (3.1), --- , (3.1b)) it is observed that 
(6.9) implies that for a suitable choice of the subsequence (n;) there exists a 
function g(y) C Le such that 


(6.10) n;(y) > oy) (weakly),’ 
while 

1 
(6.10a) [ ¢ (y) dy < A. 


On taking account of (1.9a), (1.9b), of (6.9) and of (6.10), it is concluded that 
the relation 


lim | Kn; (x, y)) en, (y) dy = [ LAé | y))e(y) dy = | f(x)) 


is valid by virtue of the known result, stated in connection with (3.5), (3.5a). 
This establishes Theorem 6.1. 

With (6.4) in view and recalling the definition of @,(x, y | \), it is deduced that 
a series (6.6a), if convergent, is representable by the convergent Stieltjes integral 


(6.11) [ “fla) (2 [ Sv) dy) ar 
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On writing 
(6.11a) 1,,(1) = va f(z) (2 f fy) dy) de 
it is deduced that 


(6.11b) > (as 1 «) 


monotonically, inasmuch as the terms of the series (6.6a) are all non-negative. 
If (6.6b) holds we have 


(6.11¢) A. 


Choose the subsequence (n;) so that the limit (6.4a) exists. It will be convenient 
to express I’,,;(1) as follows: 


whe 


(6.12) 

%) = 2 [ fey) ay. 
Now 
(6.122) [ Paz s az 


and, hence, the n; can be so selected that 
(6.12b) > q(z, 4) = = Sly) ay y |) dy 
in the weak sense with respect to z;” 
1 1 
(6.12¢) [ [ 


In view of (6.12a) and (6.12b) application of [(3.5), (3.5a)] is possible so as to 
obtain 


(6.13) lim p;(A) = p(d) = f(x)q(x, d) dx (cf. (6.12b)). 


We also have 
(6.14) | p,(a) |, |, Var. p(A) S [ 


In consequence of (6.13) and (6.14) an application of a theorem of Helly will 
result in the relation, valid when (6.6b) holds, 


% Cf., for instance, Trjitzinsky, loc. cit.; [(3.9), (3.9a)]; the latter relations will hold 
for the more general kernels of the present paper. , 
% This is a consequence of certain formulas in (C), extended by Triitzinsky, loc. cit. 


[(3.11)]. 
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(6.15) Him = lim = [ ((6.13), (6.12), 


provided the n; are suitably chosen. 
By (6.11¢) and (6.15) 


(6.16) = < A, 


where A is independent of 1. Now, in view of the manner in which I(/) has 
been obtained it follows that ['(l) is a monotone non-decreasing function of 
l,asl— «©. Hence, by virtue of (6.16) it is deduced that 


lim =r = [ = Bs 4. 
Thus, if (6.6b) holds then necessarily the integral 


exists and T < A, provided that the subsequence (nj) involved in the definition of 
Q(x, y | \)(6.4a) ts suitably chosen. 

The question as to whether there is a converse to this result is of considerable 
interest, but is at this time left open. 

Suppose now merely that the statement with respect to (6.6) holds. With! > 0 
we have 


(6.18) 


+ [ (2 [ hia) dy) = R(j,)) 


for all h(x) C Le. In fact, (6.18) follows from 
[ Ke Pn; Wy) en; ay | ax 
1 


where the first summation is over the values v for which | \,;,| < J, while the 
second summation is over the other values v. By Parceval’s relation 


R(j, 0) = [ dy [ h(x) (2 [ h(y) Qn y |r) ay) de 


(6.18a) 


(6.19) 
= [ h?(x) dz. 


Now 


(6.19a) Rj, 0) = A(j, 1) + RG, D, 
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where 
[aw 
(6.19b) A(j, 1) = [ a x ay y | ) dy) ae. 
We shall have 
l 1 1 
0 
6.20) lim AG,) = AD = [ (2 [ ae 
Jo ax Jo dy 
for a suitable subsequence (n;); this is established with the aid of a reasoning 
of the type used in proving (6.15). 
By (6.18), if one writes 
d [ [ 0 
(6.21) | 1) = A(x) h(y) ay | d) dy, 
in consequence of (6.19) it is inferred that 


(6.22) = de 0) = W(x) de. 


In view of (6.22) and of [(3.1), --- (3.1b)] it is noted that the subsequence 
(n;) can be so chosen that there exists a function o(z | l) for which 


(6.22a) lim o,(x | 1) = o(x | I) (weakly) ; 
by (6.22) and (6.22a) one also will have 
(6.22b) lim [ ax = az ar, 


which is a consequence of [(3.5), (3.5a)]. 
By virtue of (6.22) and (6.22b) 


(6.23) lim R(j, 1) = [ o°(a | 1) dz. 
Moreover, by (6.22a) and (6.21), for 7 — 
x l 1 a 
dx = h(x) de — [ a, [ iy) dy 


— |D de = Fe|D, 


where F(z | 1) is absolutely continuous in z. By a reasoning of the type em- 
ployed before it is concluded that 


l 1 l 1 
2. 4 
Thus, for almost all zx on (0, 1) 


(6.28a) o(z|l) = F@ |l) = h(x) — [ dy [ h(y) y |d) dy 
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and, in view of (6.23), 
(6.24) lim RG,) = (x) — [ dy h(y) ay dx. 
Consequently from (6.19a), (6.20), (6.24) and (6.19) we infer that 


lim R(j, 0) = dy [ (2 [ h(y) 5 ay) de 


With the aid of certain results, which can be extended to our kernels,” it is 
observed that for the function (6.23a) we have 


(6.26) o(x |) o(2) (weakly in 23h &), 
where o(x) satisfies 

(6.26a) KG, oly) ay = 0. 

Clearly 

(6.26b) o(x|1,) dx < W(x) de. 


Turning to (6.25) it is noted that, inasmuch as the first integral displayed is 
monotone non-decreasing, as 1 — ©, both integrals of the second members of 
(6.25) possess unique limits. Accordingly, put | in (6.25) equal to l, and let 
v— ©. In view of (6.26), of (6.26b) and of [(3.5, (3.5a)] 


lim [ o(x|l,) dx = [ o (x) dz. 


Thus, from (6.25) it is deduced that 


h(x) (2 [ h(y) y iv) da 


(6.27) 
+ [ oe) dz = ae. 


Since (x) satisfies (6.26a), from (6.27) it is inferred that the following result 
will hold. 


If for a subsequence (nj) of (n) each sequence of functions (6.6) forms a comple e 
set and if the only solution < Le of the equation 


(6.28) | y))e(y) dy = 0 


*'Cf., for instance, Trjitzinsky, loc. cit. [(3.23), (8.23c)]. 
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is g(y) = 0 (almost everywhere) then there exists a closed function Q(x, y | d) 
(cf. (6.4a) and the statement in connection with (6.5)). 

It will be convenient to introduce the following definition. 

DeFINITION 6.1. An operator L,(é | h(x)) will be said to be closed if the relation 


(6.29) L(g | h(x)) = 0 (h(x) I) 
implies h(x) = 0 (almost everywhere on (0, 1)). The equation 


6.30) Lilt | K(x, y) dy = 0 


will be said to be closed if the only solution & Le of (6.30) is ¥(y) = 0 (almost every- 
where on (0, 1). 

Suppose now that the equation (6.30) is closed, according to Definition (6.1). 
With this hypothesis in view suppose, if possible, that there exists an infinite 
subsequence (m,) of (n) such that for m, m2, --- the following holds. 

The equation 


(6.31) | Wy) dy = 0 


has a solution ¥;(y) C Le, which is distinct from zero; that is, 
¥ily) dy 0. 
Multiplying by suitable constants we arrange to have 


1 1 
(6.314) Lilt | Knj(x, y) Wy) dy = 0, [ Vy) dy = 1 


forj = 1, 2,---. On taking account of [(3.1), --- (3.1b)] the m; could be so 
chosen that there exists a function ¥(y) such that 


(6.31b) lim y;(y) = ¥(y) (in the weak sense). 


Furthermore, by [(3.5), (3.5a)] we shall have 
1 1 
(6310) tim [ ay = [ dy = 1, 
7 0 


corresponding to the aforesaid sequence (m,). 
Now, in view of (1.9a) and (1.9b), together with the second relation (6.31a) 
and (6.31b), application of [(3.5), (3.5a)] is possible, yielding the result 


(6.32) lim | Km,(x, y) Wily) dy = Lg | K(z, y))¥(y) dy = 0, 


where ¥(y) satisfies (6.31¢). The equation (6.30) is accordingly not closed, 
contrary to hypothesis. Hence there exists no infinite subsequence (m,;) of (n) 
for which (6.31) has a solution ¥;(y) C Le, distinct from zero (j = 1, 2, --:)- 
Thus, the following has been established. 
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If the equation (6.30) is closed (Definition 6.1) then the equations 
1 
(6.33) [ | Kn (2, y) dy = 0 


will be all closed, except at most for a finite number of values n; say, 


n(1), n(2), n(9). 
Suppose (6.30) is closed; consider an equation (6.33) (for n # n(1), --- n(q)); 


the equation 
(6.34) [ Kate, WwW) dy = 0 


will have no solution C Le, distinct from zero. In fact, in the contrary case let 
Ww(y) C Le be a solution of (6.34) distinct from zero. Then, by (1.9d) 


1 
[ Lil | Ka(x, y) dy = 0. 


This, however, contradicts the fact that the equation (6.33) is closed (for 
n # n(1),--- n(q)). Thus, with (6.30) closed, every relation 


(6.34a) K,.(2, y)W(y) dy = 0 (W(y) C Le; n ¥ n(1), n(q)) 


will imply that ¥(y) = 0 (almost everywhere). 
. Hence one may replace the result in connection with (6.28) by the following 
more complete result. 

If the equation (6.30) ts closed (Definition 6.1) then the infinite sequences (6.6) 
(with n; # n(1), --- n(q); ef. (6.33)) are all complete and there exists at least 
one closed function Q(x, y | d). ; 

We accordingly have the following theorem. 

THEOREM 6.2. The conclusions of Theorem 6.1 will hold if the condition stated 
in connection with (6.6) ts replaced by the requirement that the equation 


[ | y) Wy) dy = 0 


be closed in the sense of Definition 6.1. 
When the operator L is closed (Definition 6.1) we obtain additional informa- 
tion regarding solubility of the equation 


1 
(6.35) [ ay = 
itself. In fact, when L is closed and when the relation 


is valid for all g(y) © Le, every solution g(y) C Ls of the functional equation 
(6.2) will be a solution of the integral equation (6.35). 
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ON BIANCHI’S PERMUTABILITY THEOREM AND THE THEORY 
OF W-CONGRUENCES 


By Guipo FvuBINI 
(Received May 23, 1939) 


1. In the theory of the transformations of many kinds of surfaces and in the 
theory of rectilinear W-congruences a fundamental réle is played by Bianchi’s 
permutability theorem. The conviction that it is possible to state a converse 
theorem has led me to the following calculations and results. In this paper 
I state also a problem which is not completely solved. We shall say that two 
surfaces are congruence-transforms of each other if the points of the two surfaces 
are in a one-to-one reciprocal correspondence and the two surfaces are the focal 
sheets of the congruence generated by the lines joining two corresponding 
points. 

Let the five points x, y, z, t, » be functions of two parameters u, v (that is, 
their projective homogeneous coérdinates yi, 2:, ti, = 1, 2, 3, 4] 
are functions of u,v). As u,v vary, these points ordinarily generate five surfaces 
2, We suppose that each of the surfaces 2, , Ly is a con- 
gruence transform of every one of the surfaces 2,, Z;, Z_,. It may happen that 
the curved lines enveloped on =, by the tangents (xz), (xt), (xn)* correspond to 
the lines enveloped on 2, by the lines (yz), (yt), (yn). In this case every line 
tangent to the surface 2, intersects the corresponding line tangent to 2, : the 
lines (xy) joining corresponding points of 2. , 2, will pass through a fixed point 
w (whose codrdinates w; are constant). By changing the factors of propor- 
tionality we can suppose 
(1) or (w; = const.;7 = 1, 2, 3, 4). 
We can now change the curvilinear coérdinates, and suppose that the tangents 
(x, z) and (y, z) envelop the lines v = const. on 2, , 2, and that the lines (2, ¢) 
and (yt) envelop the lines u = const. Then the tangents (x, z) and (y, z) are 
identical with the tangents (2, x,,) and (y, y,); and their intersection z is identical 
with the point 


‘ Corresponding points on these surfaces are the points defined by the same values 
of may not be a congruence transform of Sy. 

* By (zz) or, by (2, z), for instance, we indicate the line joining the points z and 2; this 
line touches the surfaces 2, and =, at these points. 
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In the same way we prove that we can suppose 


(3) t= ty = 

The line through z and ¢ also passes through 7; therefore we can find a function 
p(u, v) such that 

(4) n= Z+ pt = Lu t+ ply = Yu + pW. 

From our hypothesis it follows that the plane tangent at z to the surface =, , 
the plane tangent at ¢ to 2,, the plane tangent at 7 to the surface =, all pass 
through x and y, that is, through z and w. Therefore z, , 2, must be linear 
combinations of z, x, w; t, , ty linear combinations of t, x, w; and nu, mn» linear 
combinations of 9, z,w. We can find eighteen functions a, b,c, A, B,C, p,q, --- 
such that 


+br pt trv, m=lh + mer 
z=Az+ Be+Cu, tt = Pt+Qr+ Rv, »=Lln+ Mz+Nv. 
By use of (4) we deduce: 4 


(5) 


(6) Pu=aQa-—p, p=A-—P; 
and therefore (a — p)» = (A — P),. From the identities z, = Tw = ty we | 
deduce: 


From the identities (2.)» = (Zv)u and (tu)» = (tv). we deduce: 
b=0, B=a, Q=0, Cu.=a+aC, r, = Ry + 7P, 
and from (6) we obtain 
™m™=C+ar, = + 
The equations (5) become: 
Luu = % = az + cw = ar, + cw 
= % = rw 
lie = t, = = Px, + Ru, 


where a is a function only of u, and P is a function of v. We can change the 
parameters of the codrdinate lines u = const. and v = const. in such a way 
that a = P = 0; and the foregoing equations become 


Luu = Cw, Luv = TU, Zo = Rw. 


From the conditions of integrability it follows that we can find a function ¢ 
such that 


C = Guu; r= R = Ow. 
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Therefore the foregoing equations become: Sir 
— PW)uu = — = — = 0. a 
Thence 
(7) ri = gw; + au + bw + ¢; (w; , a; , , = const.) 
3 or 
(7’) +e, 


where w, a, b, ¢ are fixed points. From (6) we get p = const., and, if we put 
n = z + pt, where p is an arbitrary constant, we find that all the «' surfaces 3, 


(x, w), (tu, w), (@», w). Hence: the congruences generated by the lines (zx, n) 

are all W-congruences; in the same way we demonstrate that also the lines (y, n) 

generate W congruences. In this case we have therefore completely demonstrated 

hed): the converse of Bianchi’s permutability theorem. And in this case we remark: an 
Not only 2, and are congruence-transforms of all the surfaces 2, , but also 

y ‘ the ~' surfaces generated by the points x + ow (where o is an arbitrary constant) 

are congruence-transforms of all the surfaces =, . 


(and only these surfaces) are congruence-transforms of both of the surfaces 2, , 2, . Fr 
We indicate, as usual, by (2, 7), (z, mu) --- also the codrdinates of the line 60 
joining the points and 2, of the line joining z and , --- . From (z, 7), = 
(tu, + (2, mu), , we deduce: 
(2, m)u = p(tu, Tv) + + peur) Ww), (w 
(x, = (ae, tu) + + w), 
It is easy to see that (x, (x, (z, (x, ’ (x, (x, are not tra 
Bit linearly independent, because they are linear combinations of (z, 7), (2, 2»), cul 


RE . 2. Let us now suppose that only two of the tangents (z, z), (z, t), (x, n) are Be 
Ae | ; the corresponding tangents of (y, z), (y, t), (y, 7): for instance we suppose that bre 
(x, z) and (y, z) [(z, ¢) and (y, t)] are corresponding tangents, but that (z, 7) ob 
does not correspond to (y, n). Let us change the parameters u, v so that the wh 
lines v = const. are the lines enveloped on 2, and 2, by the tangents (z, 2) no’ 
and (y, z) and that the lines wu = const. are the lines enveloped by (z, ¢) and Th 
(y, t). We can find twenty functions a, b, ..- of u, v such that col 
(8) Ly = ax + bz Yu. = Ay + Bz ‘ 
(9) %=pr+q y= Py+Qt cor 
(10) = he +ke +ly tu = ct +ex + fy 
2, = Hz + Kx + Ly tp = Ct + Ex + Fy. 
3 (12 
Obviously 
(11) b¥0, Q#0. 
* Tf, for instance, b = 0, then z, = az; and the point z could not generate a surface. (1 


if 
‘ j 


ON BIANCHI’S PERMUTABILITY THEOREM 623 


Since the point » belongs to the line passing through z and ¢, we can write 
(12) n=z-+ ol, 


where p is a function of u,v. Also », and 7, must be linear combinations of 4, 
z,y. If we calculate y, and 4, , we deduce by means of (10): 


Pu —¢, 
p 


(h — c), = (H — 


From the last equations it follows that we can multiply the homogeneous 
coérdinates of z by the same factor, and the coédrdinates of ¢ by another factor, 
so that 


h=c, H=(C, 


(we could also make h and c equal to zero). Therefore p = const. 

If p is an arbitrary constant, y, and 7, are always linear combinations of 7, 
z, y; and all the ©’ surfaces (depending only on one constant p) are congruence- 
transforms of 2, and Z,. The lines (xz, n) = (x, z + pt) envelop on =, the 
curved lines defined by the equation 


and the lines (y, 7) envelop on 2, the curves 
du dv 


Because of our hypothesis, these curves do not correspond; and therefore 
b:q B:Q. If we give to p two values p and (p p, 0, ©) we 
obtain two points 7 and m , which generate two surfaces 2, and Z,, , both of 
which are congruence-transforms of 2, and 2,. And the tangent (zx, 7) does 
not correspond to (y, 7); the tangent (zx, m) does not correspond to (y, m). 
Therefore by changing 7 and m into z, t, we can suppose that (z, z) does not 
correspond to (y, z) and (a, t) does not correspond to (y, ¢). 


3. Let us now suppose that the tangents (z, z) to S: and (yz) to S, are not 
corresponding tangents. We can change parameters so that the lines (z, z) 
envelop on 2, the lines v = const., and the lines (y, z) envelop on 2, the lines 
u= const. Therefore there exist four functions a, b, A, B of u, v, such that 


(13) Yu = ax + bz, Yo = Ay + Bz. 


Since the plane tangent at z to 2, is the plane of the points x, y, z, we can find 
six functions k, s, w, --- so that 


(14) Zu = kz + sx + wy, zy = Kz + Wa + Sy. 


| 
“9 
| 
| 
‘ | 
), 
) du dv 
d q-P 
0 
— 
‘ 
it 
d 
\ 
| 


624 GUIDO FUBINI 


The tangent plane at x to 2, [at y to 2,] is the plane of the points 2, z, ¢ [of the 
points y, z, t]. Therefore 2, is a linear combination of z, z, t, and y, of y, z, t. 
We can write: 


(13’) ty = cx + fe + gl, Yu = Cy + Fz + Gt, 


where c, f --- are new functions of u, v. 
We can also find for ¢ two equations which are analogous to the equations 
(14) for z: 


(14’) t, = ht + mx + ny, tp = Ht + Na + My. 
Instead of (13’) we can write the equations 
(13”) ba, — fr. = (be — af)x + bgt; By. — Fy = (BC — AF)y + BGT. 


If also the tangents (xt) and (yt) are not corresponding tangents, the difference 
bB — fF is not equal to zero; and we can choose the factors 1:p and 1:4 so that 
_ 1Fdu + Bdv _ 1fdv + bdu 
bB—fF ’ bB — fF 


(whence du = —fpdU + BaodV; dv = bpdU — FadV) become exact differ- 
entials. We can substitute for the equations (13’) the equivalent equations 
(13’’), which we have now transformed into 


"aV 


and these equations are completely analogous to the (13). Therefore there is 
no substantial difference between the points z and t, or bétween the equations 
(13), (13’). Also in this case we can write: 


n= 2+ pl; 


and the ., 7» must be linear combinations of n, z, y. We deduce 


_ 


(13’”) p(bc — af) x + pbgt = o(BC — AF)y + oBGt, 


and we can multiply the homogeneous codrdinates of ¢ and z by such factors 
that 


(15) <K=4dH; k=h=0. 


Then p = const.; and, by giving to p an arbitrary constant value, we obtain 00! 
surfaces 2, which are all congruence transforms of =; , Dy ; and conversely, each 
surface, congruence transform of 2, and Z, , is one of the preceding surfaces 
z,. From (13), (14), (13’), (14’), we can deduce the values of the derivatives 
Of Tu, Tv, Yuy Yo Zu, 2v, tu, ty as linear combinations of x, y, z, t. The condi- 
tions of integrability 
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must be transformed as identities, because the points x, y, z, ¢ are linearly 
independent. Let us now compare the coefficients of y in the first of these 
equations, the coefficients of x in the second, the coefficients of ¢ in the third, 
and the coefficients of z in the fourth. We get, from (15): 


(16) bS = fw + gn 

(16’) Bs = FW + GN 
(16”) sg = SG 

(16””) mf + nB = MF + Nb. 


By comparing the coefficients of z (of ¢) in the third (the fourth) of the pre- 
ceding identities, we have: 


sf +wB = SF+Wb+H,, m=MG+4H., 
whence, by eliminating H, : 
(17) j = bW — fs — GM = Bw — FS — gm. 


(We have called j the value of the second and third members.) From these 
identities it follows: 


If one of the congruences generated by one of the lines (a, z), (x, t), (x, n), (y, 2), 
(y, t), (y, n) ts a W-congruence, these congruences are all W-congruences (and we 
have precisely the case of Bianchi’s permutability theorem). 

In order to demonstrate this proposition, we begin by remarking that, by 
changing notations, we can indicate by 2. , 2; two surfaces arbitrarily chosen 
among the «' surfaces z,- Therefore it is sufficient to prove: 

If the congruence generated by (x, z) is W, the three congruences generated by 
one of the lines (x, t), (y, 2), (y, t) are also W-congruences. The congruence 
K;, = K., generated by the lines (z, z) is W if and only if the codrdinates (z, z) 
of the lines (x, z) and their derivatives of first and of second order are not 
linearly independent. By means of the preceding equations we find that all 
these derivatives are linear combinations of the codrdinates of the six lines 
(z, z), (y, z), (a, t), (y, 8), (x, y), (t, z). The congruence K;, is W if and only 
: the determinant A,, of the coefficients of these combinations is equal to zero. 

e find 


= (x, 2) = (tu, 2) + (2, 2u) = a(x, z) + w(z, y), 


(x, 2)» = (c + H)(z, z) + g(t, z) + S(z, y). 
Disregarding the terms in (z, z), (2, y), (t, 2), we find also 
(2, = wlb(z, y) + G(z, +---, 
(, = wib(z, y) + g(t, + 
(x, Z)ev = (Sf — Mg)(zy) + 2Sg(t, y) — 
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Therefore 
1 0 0 0. 0 0 
a w 0 0 0 0 
_\|c+H S g 0 0 0 
we 
uf 0 wg 
Sf— Mg —gW 2g 
b G 0 b G 0 
=wg| f 0 gi=wy| f g 
Sf— Mg -—gW 28g -W 


= gw'(bgW — GfS — gGM). 

By means of (16’’) and (17) we deduce: 
(18) +An = 
By analogous calculations we find 

(y, Zu = Cly, z) + Gt, z) + s(y, 2), 

(y, 2)» = (A + A)(y, z) + Wy, 2), 
and, by disregarding the terms in (y, z), (y, x), (t, 2): 

(y2)uu = (Gm — sF)(x, z) + 2Gs(t, + Gu(t,y) +---, 


(y2Z)uv = G[N (x, z) + Wit, x) + S(t, y)] s{g(t, y) + Bia, z))+---, 


(y2) vv —Wig(t, y) + z)] + 


and we find immediately that the determinant A,-. of the coefficients of these 


linear combinations is given by 


Gw Gm-— sF 
S N W 
g B 0 


= GW’ g(GmW — 2sGN — sFW) — B(GwW — 2s8S8@) 
= GW’ (gmW — 2GSN — SFW — BwW + 2sSB) * 
= GW* (gm + SF — Bw), * 


Ay: = ¢ Ww 


and therefore 
(18’) +tAy = 


* By means of (16’’). 
5 By means of (16’). 
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Let us turn now to (zt). We find 
(x, t)u = a(x, t) + b(z, t) + n(z, y), 
(x, th» = (c + H)(2t) + f(z, t) + M(z, y), 
and, by disregarding the terms in (z, t), (2, #), (x, y): 
(z, tau = b{ —m(x, 2) — 2) — w(t, y)} 
+ y) + F(a,2}+---, 
(1, = b{ -N(a, z) — M(y, z) — y)} 
+ n{ Bix, 2z) 
(2, = f{-N(a, 2) — z) — S(t, y)} 
+ z) + fey) 


Therefore the determinant of the coefficients of these linear combinations is 


given by 
1 0 0 mn w F 0 
c+H f M ¢ 
In the right-hand member —n is multiplied by 
F -—-b 0 
N M_ S8j\=g9(FM + dN) + FfS — B(bS) 


= g(mf + nB) + FfS — B(fw + gn) 


(by means of (16’””) and (16). Therefore —n is multiplied by —fj. The coeffi- 
cient of b is 


mn w 
NM 8 


|B -f g 


= M(mg — wB + SF) + S(nB — MF + mf) — nNg — wNf 


= —Mj + N(Sb ng — wf) = 
(by means of (16’”) and (16)). Therefore 
(18”) = (bM — nf)'j. 
In the same way we find: 
(y, Yu = Cly, t) + + mty, 2), 
(y, t)» = (A + A)(y, t) + Bez, t) + Ny, 2), 


| 


‘iil 


628 GUIDO FUBINI 


and, by disregarding the terms in (y, ¢), (z, 4), (y, 2): 
(Y, Yuu = —F[m(x,z) + nly, z) + s(t, x)] 

— mlb(z, y) + F(z, 2) + G(x, +-.., 
(y, Quv = z) + M(y, 2) + Wt, x)] — mif(z, y) + Biz, +.---, 
(y, = —BIN(a, z) + 2) + Wit, x)] — y) + Biz, +---. 


Therefore the determinants A,, of the coefficients is given by 


1 0 0 b F G —n Mm —s | 
+tAy=! C F N —-W\-F\-M N 
A+H BN f B 
= (FN — mB)*|—bWB + f(FW + NG) + GMB\m 
+ [nBW + fmW — sfN — M(sB)|F 
= (FN — mB)’ {mB(—bW + fs + GM) 
+ F(nBW + fmW — sfN — M[FW + GN))} ° 
= (FN — mB)’ {(mB — FN)(—bW + fs + GM) 
+ W(—bFN + FnB + Ffm — MF’} 
= (FN — mB)*(oW — fs — GM). 
Therefore 
= (FN — mB)*j. 


Let us now remark that 

A) wy # 0. For, if g = 0, the tangents (z, x.) and (z, x») would coincide. 
If w = 0, 244 would be a linear combination of z, z,. And in this case the 
lines v = const. on 2, would be straight lines, and their tangents (x, 2.) would 
not generate a congruence. 

B) WG # 0, because the tangents (y, y,) and (y, y») cannot coincide, and 

because yy, cannot be a linear combination of y, y . 
C) bM — nf # 0. Together with (13’”’) we can write 


If bM — nf = 0, the derivatives d°x/8U” would be a linear combination of 2 


and d2/dU. 
D) FN — mB # 0. Together with (13’”) we can write 


+b = p{H(b — f)t + (ON — fm)x + — fn)y}. 


rt) = o{H(B — F)t + (Bm — FN)x + (Bn — FM)y}. 


° By means of (16’). 
7 By means of (16’’’). 
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If FN — mB = 0, the derivative d°y/dV" would be a linear combination of y 

and dy/dV. 
From these remarks and from (18), (18’), (18’’), (18’’”’) it follows that, if one 
of the determinants Azz , Aye , Azt , Ay: iS equal to zero, then j = 0, and all these 
determinants are equal to zero; which demonstrates precisely the stated theorem. 
But this theorem does not prove that always j = 0, i.e. that the congruences are 
all W-congruences. We must now remember that in Bianchi’s theorem there 
are two systems of ~' surfaces, and that every surface of a system is a congruence- 
transform of every surface of the other system. In this case we can complete the 
preceding result, and demonstrate precisely that all congruences are W-congruences. 
We must suppose we can find a function o(u, v) such that the surface generated 
hy the point « + oy will be congruence-transforms of 2,, 2. This is true 
only if d(z + oy) is a linear combination of x + cy, z,t. But 


d(x + oy) = x(adu + cdv) +o| Cd + Adv + oes, 


where we have not written the terms in z or ¢. Our condition will be satisfied 
if and only if 

adu +cdv = Cdu+ Ado 
that is, if (C — a) du + (A — c) dv is an exact differential (and o will depend 
only on one arbitrary constant). We can therefore find o, if and only if 


a(A—c) _ aC —a) 


In the condition of integrability d2,/dv = dz,/du, let us write the term in 2; 
and in the identity dy,/dv = dy,/du, let us write the term in y. We find 


da 8c | aC 0A _ 
+ gm — bw, FS — GM. 


These two derivatives will be equal to each other if and only if 
fs+gm — bW = Bw — FS — GM or fs + GM — bW = Bw — FS — gm. 


By comparing with the identity (17) we get 7 = 0; which proves the theorem 
quoted. 


4. We solve now another problem. When does it happen that two surfaces 
2y, congruence transforms of an identical surface 2, , are also congruence- 
ransforms of at least two other surfaces =, , and consequently of ~' surfaces 2, ? 

After changing notations, let us suppose that wu, v are asymptotic parameters 
on 2, and that z are projective non-homogeneous coérdinates. It is known‘ 


that we can determine seven functions 6, B, y, A, B, A’, B’ of u, v such that 


* Fubini e Cech, Geometria proiettivo differenziale (Bologna, Zanichelli), vol. I, pp. 89, 
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Zuu = + Bry = Ovty + Y2u 
= pz + 2(Az, + Bey) y = wz + 2(A’z, + 
= — Boo) n= NE + — BS), 


where ¢, &, 7 are the coordinates of the planes tangent at z, z, y to the surfaces 
22, Z,, and where 


w= AX +A + (3, - + + Br), 


A, + 1) 
A 


—A, — + A AP + + Bo, B 
and uy’, \’ are given by the analogous equations, obtained by changing A, B, 
w into A’, B’, 

The points ¢ lie on the intersection of the planes £, 7, which is the straight line 
joining the points z and 


+ — dy) [4 = A’B — 
| Therefore we can write 


t = Rz + + — dy). 


{ t We must determine FR so that the plane tangent at ¢ to sw shall be the plane 
r ny Mihi passing through ¢, x, y; we require also that dt be a linear combination of t, z, y. 
| We remark that 
d2uy = + + weer , 
— dy) = wat + + wot + wey, 
where 
wr = + By) du + = Yu + Y4u) 
we = + (Ou + By) dv (mn = Bv + Bb) 
ws = 2(\’B — \B’) du + (VA — XA’) dv B 
and where we need not calculate w; and w.. Therefore tk 
4A(dt — — wey) — (wr + 44 + 4A d0)t = 4A(dR + os) 


— + 44 + 4A + 4A(R du + + dv + 
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isa linear combination only of z, 2. , 2v (and not of zu»). Our condition is satis- 
fed only if this expression and 2, y are not linearly independent,’ ice., if 


(19) dR = (Q+Q)R — ws — — Bu’) + — Ap’) 


where 


ButA A,+B 
and Q’ is given by the analogous equation deduced by changing A, B, into 
A’, B'. According to our hypothesis that R has at least two values, the condi- 
tion of the integrability of (19) must be identically satisfied; and therefore R 
has ©* values, depending only on one arbitrary constant. The first condition 


of integrability is 


dv, 


Q + Q = an exact differential. 


We find again a theorem already proved in the preceding pages. 

If one of the congruences (x, z) and (y, z) is W (that is, if Q or ’ is an exact 
differential), the other congruence also is W. By multiplying A, B (and conse- 
quently 4) by the same factor r, the point x does not change; and the point y 
does not change if we multiply A’, B’ by the same factor r’. If 2 + ©’ is exact, 
we can obtain (by a suitable choice of these factors) that 2 + 2’ = 0, and also 
that 


A, + By = A, + B’y = 0; 
(20) 


= ¢’, where + = 0; 


and, by supposing that these equations are satisfied, we find that the second 
condition of integrability is 
0 = + + + u'(B, + 


(21) 
+ — — (Buy + Bus) + (Apu + 


’ Our condition is equivalent to the equation 
4A(dR + w;) — 4A(Rdu+ 4d) 4A(R dv + 
0= 2A 2B 
2A’ 2B’ 


By expanding we find that in the right-hand member the coefficient of dR is — (44)? and 
the coefficient of R is 


+ 4dA + 4A — 4A(2uB’ — 2By’) du + 4A(2uA’ — 2Qy’A) dv = (4A)*(2 + 0”). 


From this remark we can deduce (19) immediately. 
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This can be seen by observing that by means of (20), equation (19) becomes 
equivalent to 
dR = + du + — du + (dws — do} 
+ 2{(uP’ — du + (uQ’ — do} 


where 


P= mA B(Ou» + By); Q A + By) — Bre, 


and P’, Q’ are given by quite analogous equations. The conditions of integra- 
bility of the projective geometry of the surfaces (loc. cit. p. 94) are equivalent to 


av + By)0. 


+ BY) u + Bre — 


Ou» + Bade + — + 
From these equations and from (20) we deduce: 
P = —(B, + Boy)» + + Bet; Q = (Au + Abu), + Boy, 
Qu — Po = —u(Ow + By) + 


and the analogous equations for P’, Q’, A’, B’, g’ = —¢y. From \ = w+ 
2(B, + Bé,) and the analogous identity for \’, u’ we get: 


ap — = — — Auby + Aube 
= Au + By + — 2(u’ + By + 
+ + — + BO,)u,; 
= 2° + + + + Bé,)]} 
= Qeolup’ + u(B, + B’0,) + u’(B, + + + Ag)» 
+ u(—At + A’, — A’y)» + (By + BOv)uy + (By + 
= P’ — we P — + Q + + 
By summing we find that dR is an exact differential only if 
0= { my’ + u(B, + B’6,) + (By + Bé,)} + + (Ag — Ab, — Au)ol 
+ — (Al. + + + + + B’O,) + (By + 
— — o(B, + BO.) + (By + BOv)u] — + + By + 


+ 2uilQ + (u + B, + Bé,)»|, 
and this equation is equivalent to (21). 
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I have not been able to find the geometrical meaning of (21) nor to study the 
evstem of equations (20), (21). We can:only remark that equation (21) is 
identically satisfied if ¢ = 0 [and also if only g, = 0; but in this case we can, 
by a suitable choice of the factors r and r’, make g = 0]. But if ¢ = 0, both of 
the congruences generated by the lines (2, z) and (y, z) are W-congruences. 
This remark gives therefore a new proof of Bianchi’s permutability theorem. 


5. We can also generalize some of the preceding results. Let us suppose 
that 3, is a congruence-transform of «' surfaces 2, , that the points ¢ correspond- 
ing to the same point z lie on a straight line, and that the corresponding planes 
7 tangent at ¢ to 2, form a pencil of planes. (These conditions are satisfied if 
the ,, as in the preceding section, are congruence transforms also of another 
surface 2,.) Changing notations, we suppose that u, v are asymptotic param- 
eters on Y,, and we calculate 6, y, 6 not for 2,, but for 2. We find that 
we can put 


t = px + 2(Az, + Br,), 
= ME + — 


where 
B A 
—A, — Ad, + By + BO, + TAB _ past Br, 


In these equations A, B depend on one constant c (because we have ©’ surfaces 
2,) and £ are the codrdinates of the plane tangent at x to 2,. If the points ¢ 
lie on a straight line, and the planes 7 form a pencil, we can obviously find four 
functions h, k, H, K such that 

uw =hA + kB, h = HA + KB; 


and these functions do not depend upon c. We will say that \, u are linear 
combinations of A, B; thence also \ + u, \ — «and consequently 


are such linear combinations. We can also find four functions n, m, p, o of 
«, » [which do not depend on c] such that: 


A 2 


Therefore the ratio A:B satisfies a system of Riccati equations. The points ¢ 
generate a range a on the plane é, the planes 7 generate a pencil 6 of planes pass- 


at 


| 
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ing through z. As the point z moves on S,, we obtain o* ranges a and »° 
pencils 6. All these ranges and all these pencils are projective to one another. 

In the case of Bianchi’s permutability theorem, this theorem is evident 
because the function R of §4 is given by R = f dR + c, and depends only on 
one additive constant c. 

We will say that «' congruences generate a pencil if they have the same 
first focal surface 2., if homologous points ¢ of the second focal surface », 
lie on a straight line, and corresponding tangent planes 7 belong to a pencil. 
We have demonstrated that r = A:B satisfy in this case a Riccati system. If 
Risa particular solution of this system, the most general solution will be given by 


where gq, s are functions of u, v and c is the arbitrary constant. Therefore 


q q 


gk q 
By comparison with (22) we find: 
Su R(s + Q)» 
qR B, q 
or 
Su = —qBR 
= + qyk 
whence 


(23’) (2). + (qBR), = qu, 8 = / | du + (2 


For each solution q of this equation we find 


2 Ru Qu ) (% ) 
= Br + }r B 


~(% _7)\24 (Re W157), _ 
Now let us calculate u. We find 


A’ BA B’ 


= + B(—r, + Br’) + | +40 + »| 


(24) 
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or 


R, qu ) ( Qu ‘ 
on —-— —~+28R})+ R= — BR 
R q 28 q B 


4 Be _ oF 
) +2/( + +27) 


For \ we get an analogous value, which we can deduce from the preceding by 
changing the sign of the two last members. 
(25) is the equation of the straight line composed of the points 


ux + + 2Bz,) 


whose local coordinates are uw, A, B. 

The analogous equation for can be considered as the equation of the pencil of 
the planes r. 

We transform now equations (23). Let R and R, be two solutions of the pre- 
ceding Riccati system; and, for instance, let us suppose that R [R,] corresponds 
to the value c = © (to the value c = 0) of the arbitrary parameter c. We put 
R = A:B, Ri = A,:B,, and choose the arbitrary common factors of A, B 
and the arbitrary factor of A; and B, so that 


_Ac+A,_A B 1 


shall be the most general solution of the Riccati system under consideration. 
In (23) we must therefore write 


ley 


Equations (23) are transformed into the following: 


0A 
+ yBi + 7B 


(26) 


+ BAi _ 


B, B 


If the congruence determined by R = A:B is W, it is known that we can sup- 
pose that the second members of (26) are equal to zero. In this case also the 


r. | 
ame = 
neil, 
If (25) | 
“i 
if 
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left-hand members will be equal to zero; and therefore also the congruence 
determined by R; = Ai: B; or by r = (Ai + cA):(Bi + cB) is a W-congruence, 
Therefore 

If one congruence of a pencil is W, all the congruences of the pencil are W. 
Two W-congruences always determine a pencil of congruences. The determination 
of a pencil of congruences, if a congruence of the pencil is given, depends on the 
system (26) of equations; and these equations are quite analogous to the equations 
upon which the theory of W congruences depends. 


6. We can also demonstrate the preceding results in another way. 

I. Let K be a congruence; the focal points x and y generate the two focal 
sheets 2, , 2, ; let and 7 be the planes tangent to 2, and 2, at two correspond- 
ing points x and y. The planes 7 are identical with the osculating planes of the 
edges of regression of a system of developables of K; and these edges lie on 2,. 
If a point x moves along such an edge, we have therefore 


= Do ndz = =0 
and consequently also 
xdn = dxdn = 0. 
Therefore one of the two systems of developables of K satisfies the equations 
> = 0, > dn dx = 0. 
It follows that >> dy dz is divisible by }> » dz, and we put 
dxdn _ dxdn 


(From >> nr = 0, it follows that > ndx = —)>>xdy.) In this equation 
Q is a Pfaffian, or a differential which can be exact or not exact. By interchang- 
ing the two focal sheets we obtain another Pfaffian 


Ay, z) = dy dé dy dé 
By calculation (loc. cit. p. 248) we can verify these theorems, and by a suitable 
choice of the factors of proportionality of x, y, £, 7 we find that 


Ay, = —QAa, y) 


and that © is the same differential 2 = Q(z, x) of (19’), §4. We get also that, in 
interchanging the focal sheets, the Pfaffian Q changes only its sign. If we change 
the factors of proportionality and substitute pz, cy(ec * 0) to 2, y, & is trans- 
formed into 


(edz + xdp)(odn + do) _ Li dxdn + 
>. n(xdp + pdz) Lads \e 


Qa, y) = 
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In other words, by changing the factors of proportionality, we can add to Q 
any exact differential. The last two theorems give all the possible indetermi- 
nations of @. We know (l.c.) that K is a W-congruence only if Q is an 
exact differential; and in this case (by a suitable choice of p, ¢) we can make 
0 = 0. 

Il. Let us now suppose that 2, , 2, are two surfaces whose points z, y are in 
one-to-one reciprocal correspondence, and that they determine a pencil of ' 
surfaces 2; ; every point ¢ of 2, lies on the straight line joining the corresponding 
points 2, y; and the plane 7 tangent to 2, at ¢ belongs to the pencil determined 
by the planes ¢, » tangent at z, y to 2,, 2,. 

We can also write 


and the equation )> rt = >> 7 dt = 0 become: 


dp +o >  ndz =0. 
It follows that 


dp _Lindz_ 


The right-hand member must be an exact differential; and, if R is a possible 
value of p, the other values of p are cR, where c is an arbitrary constant. And 
if we substitute Ry for y, we get 


III. Let us suppose that every surface 2, is a congruence transform of a 
fixed surface ,. The lines (2, t) generate a congruence K(z, ¢) whose focal 
sheets are Z,, D, ; and, as Z; varies, this congruence generates a pencil of con- 
gruences. If € is the plane tangent at x to =,, and if wu, v are curvilinear co- 
ordinates on our surfaces, we must have 


Oe, 2) = _ Liddy _ ie 
(z, x) ede rdu + sdv; ay, x) mdu +n 


At, x) = uaa = pdu + qd, 


where r, s, m, n are functions only of u, v, and p, q are functions of u, v,c. By 
substituting for ¢ its value z + cy, the last equation becomes: 


dt dz + dn = (pdu + Edz +c Edy) 


or 


(rdu + sdv) dz + c(mdu + > Edy 
= (pdu + qdv)[X + Edy]. 
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But >> édy and >> édz are not proportional (because the surfaces ¥, and 
y, are not identical). And therefore 

(r du + sdv) — (pdu + qdv) = cR Edy 

(pdu + qdv) — (mdu + ndv) = R > é dz, 


where R is a function of uw, », ¢. 
By summing we get 
(r du + sdv) — (mdu + ndv) = cR Dtdy + RD Edz. 


By derivation with respect to c, we obtain 


a(cR) aR 
+ Ede = 0; 


therefore R and Re do not depend upon c; and consequently R = 0. We get 
also 


rdu + sdv = mdu + ndv = pdu + qd, 
that is 
Q(z, x) x) a(t, x), 


and these equations give the theorem on the pencils of congruences we have 
demonstrated above (§5). 

IV. Let us suppose now that the surfaces 2,, 2., 2, are also congruence 
transforms of another surface 2, . From the theorem of §4 we deduce: 


Q(t, w) + Q(t, x) = 0. 


Also, if only one of the differentials Q(t, x), Q(t, w) is exact, all those differentials 
[which are all equal to Q(z, w) or Q(z, x)] are exact. This is precisely the first 
theorem of §1. 

V. Let us suppose now that the surfaces 2,, 22, 2, are congruence trans- 
forms not only of 2,, 2», but also of the surfaces 2; of the pencil determined 
by 2., Zw. We have in such a case 


0 = Ay, + Ay, J) = Ay, 7) + Ay, w) = Aly, w) + ALy, 2). 


Therefore all the 2 are equal to zero,’® and all the congruences are W congru- 
ences. This is precisely the second theorem of §1. 

In a preceding paper of mine, On a property of W-congruences (these Annals, 
vol. 41, no. 2 (April 1940) pp. 356-364) the following corrections are to be 
made on page 360. In line 3 replace A, + AO, by Az, + Bz»; also in line 3 
replace B, + Bé, by Az, + Bz,; and in line 6 replace x by y. 


INSTITUTE FOR ADVANCED Srupy. 


© We could also say that the preceding sums are exact differentials, and that conse- 
quently also the © are exact differentials. 
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ON CONTINUOUS MAPPING 
By Ne.tson Dunrorp 
(Received June 17, 1939) 


1. Introduction 


In this paper we are concerned with conditions which are sufficient to insure 
the existence of an interior point in the map f(G) of a non-void open set G under a 
1-1 continuous function f. The functions f actually considered in certain of our 
results form an extension of the notion of the class of 1-1 maps. The term 
distributive has been used to describe this more general type of function for the 
reason that every homomorphic’ map between metric groups with a one-sided 
invariant metric is an instance of such a function. In the case of continuous 
homomorphic maps on groups or continuous additive operations on a linear 
space it is not difficult to show that if the map of every open set contains an 
interior point then the mapping must be interior, i.e., take open sets into open 
sets; and this interior property of linear maps is of prime importance in the 
theory of linear operations. In the general case we are considering here one 
cannot hope for so much, for it is easy to give examples of 1-1 maps between 
complete separable and locally compact metric spaces which transform non-void 
open sets into sets with an interior point and yet are not interior maps. There 
are results of the type we are considering which may be readily obtained from 
certain very restrictive assumptions. For example if the range space satisfies 
the first Hausdorff axiom of countability and the domain space is regular and 
locally compact it is easily seen that any continuous map which transforms 
open sets into sets which are not non-dense must actually take every non-void 
open set into the sum of an open set and a non-dense set. This and other similar 
results which may be obtained from a compactness assumption have been 
omitted since they are comparatively trivial. In no place do we assume that 
the range space satisfies the first axiom of countability and in no place do we 
assume local compactness on either domain or range space. 

The paper divides itself into three parts. In the first (§§3, 4) we are concerned 
with placing certain results which are well known in the theory of metric spaces 
or in the theory of the neighborhood spaces of Hausdorff on a broader basis. 
These results center around the notions of category, the condition of Baire, and 
the operation A of Souslin. Since the relations between these notions are of 
considerable importance in many phases of topology we develop them here for a 
general type of neighborhood space in which the product of two open sets need 
not be open. Rather than define the closure operation by means of the neigh- 

‘Le. f(@ + y) = f(x) + f(y) if the groups are written additively. 
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| borhoods we introduce it postulationally but later in certain of the results in §5, 
uy assume that it is the ordinary closure determined by the neighborhoods. 

) The second part of the paper concerns itself with continuous functions f. 

In order to get an idea of what sort of conditions might be imposed in order that 

) the map f(@) of an open set contain an interior point we examine the known’ 

instance of a continuous homomorphic map of one complete metric separable 

group upon the whole of another such group. Here, as shown by Banach, the 

fy map must be interior. In this situation we see immediately (in view of the 

1 Baire category theorem) that every non-void open set G transforms into a set 

f(@) which is not non-dense. This condition, which is obviously a necessary 

one, is quite a long way from the desired conclusion, i.e., f(@) contains an interior 

point. In this case considered by Banach we can obtain considerably more as a 

starting point, namely that f(@) (@ open and non-void) is a Baire set of the 

H second category. This is because the map, being between complete separable 
spaces, takes Borel sets into analytic sets and such sets satisfy the condition of 
Baire. That f(G@) is actually a second category Baire set follows immediately 
BA iit from the Baire category theorem and the additive nature of f. The fact that 


i f(G) is a second category Baire set seems to be considerably nearer the desired I 

; i 7 conclusion than the statement that f(@) is not non-dense. These two observa- t 

i : | tions give us the clue used in §5. In the first two theorems of §5 we have as- f 

Ret) sumed something slightly less restrictive than the assertion that f(@) is a second 

f i if category Baire set. Under such an assumption on f very little need be assumed 0 
P ely 2 about the domain and range spaces. In Theorem 5.5 we have assumed only ¢ 

that f(@) is not non-dense. In this situation we have had to restrict the spaces t! 


concerned considerably. The range space is assumed to be a Hausdorff space 


i . and the domain space locally complete and locally separable. It might be ( 
oe mentioned here (and also used as an example showing the necessity of the 1-1 . 
f | assumption in Theorem 5.1) that a continuous function y = f(x) may be defined N 
on a separable metric space X and have for its range the whole of the line segment 
0 < y S 1 (with its usual topology) and take every open set into a Baire set of 
second category and still have the interior of f{(S) empty for every sufficiently 
small sphere S. To construct the space X we may take two unit line segments 
in the plane each directly above the segment (0, 1). On one segment remove all (3 
rational points and on the other remove all points of another denumerable dense D 
set which contains no rationals. The metric in X is the usual distance in the. 
plane and the function y = f(z) is a projection of x onto the segment (0, 1). ( 
In the last part of the paper, §6, an application is made. I 
Finally we wish to point out that the method of proof used in Theorem 3.1 is ‘ 
due to Sierpinski.’ 
2S. Banach, Uber metrische Gruppen, Studia Math., vol. 3(1931), pp. 101-113, in particu- N, 


lar Theorem 8. 
’ W. Sierpinski, General Topology, University of Toronto press (1934), p. 194. 
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2. Notation, axioms, and definitions 


Throughout the paper we shall use the following notation: X = (z) is a set 
of points, B(X) is the power set of X, ie., the family of all subsets of X, | 
XC B(X) is a family of sets called neighborhoods, E called closure of E, is a 
function on B(X) to P(X). The symbol N with or without subscripts or 
superscripts will always denote a neighborhood. A set whose closure contains 
no N is said to be non-dense. Later we shall restrict the family N and the 
closure operation E according to a practise in common use but for the present 
the following less restrictive axioms are sufficient. 


NC P(X) 

(i) NeNimplies N ¥ 0, not void. 

(ii) NiNe ¥ 0 implies the existence of an Ns C NiN2. 
B(i) Any subset of a non-dense set is non-dense. 

(ii) N C E implies N C NE. 

(iii) The void set ts non-dense. 

It should be noted that the N3; in A(ii) is not required to contain a specified 
point of NiNe. In fact, until §5, we shall find it convenient not to mention Ah 
the points x of X but deal entirely with the sets EF « $(X). For this reason a | 
few of the definitions to follow may sound a bit unusual. 

The first small greek letters a, 8, y, 5, €, will be used for ordinals and the last 
ones ¢, x, ¥, w, when written without primes, for cardinals. The axiom of 
choice will be used throughout the paper and the symbol ¢’ will always mean 
the smallest ordinal corresponding to the cardinal ¢. . 

The family of sets EZ in $(X) which can be expressed as the sum of ¢ non- ; 
dense sets will be denoted by J, , and we define JJ, = $(X) —1,. Aset E 
is said to be locally I, if for every non-void subset E’ of E there is an N with 
NE' # 0, NE ely. 


3. Category theorems and the class %, 


3.1: If E is locally I, then 
Well-order all N having the property that NE ¢1, ; ' 


Define 
(3.2) Mi=NiB, M.=NeE- DNs. 

<a 


It follows from B(i) that subsets of I ¢ sets are also J, sets and hence we may 
Write 


M, = Mz where M2? is non-dense. 


Now define 


EY = >) M2. 
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It will first be shown that 
(3.3) E= > E’. 


Obviously E D> Now if = E — E” ¥ O there is an N 
with NE, ~ 0, NE«I,. Thus let N, be the first in the series (3.1) such that 
NE, ~ 0. Then 


M. =.N.E > N.E: 0, 


so that 
0+ = : N.E:M 


Hence there is a y < ¢’ such dak: fortiori ~ 0, 
N.E; Dov<s' E” # 0, which is a contradiction to the definition of E,. This 
establishes equation (3.3). It remains therefore to be shown that the set 
E’(y < ¢’) is non-dense. Suppose on the contrary that for some y < 4’, 
E’ DN. By B(i) then N C. NE’ which. shows (using B(i) and the fact that 
the void set is non-dense) that there is some a for which NN, # 0. Let N, 
be the first in the series (3.1) such that NN.,# 0. There is by A(ii) an N, C 
NN, and we have N,N. = 0 (a < 6), afortioriN,M,.=0(a<e). But from 
(3.2) it follows that N.M. = 0 fora > eand thus N,M, = Ofora>e. Thus 


and 
(3.4) N,E’=N, =N,M7C M?. 
* 
Now 


N, CNN. CN C E" 


and therefore N, C N,E’, a contradiction to B(i), (3.4), and the fact that 
M? is a non-dense set. This completes the proof of Theorem 3.1. 
_ For an arbitrary set E we define E} as the sum of all subsets e of E for which 

there exists an N De such that NE eI,. 

THEOREM 3.2: For an arbitrary set E the set E; isals set. 

In view of Theorem 3.1 it is sufficient to show that E; is locally I,. Let 
0 ~ e’ CE. There exists an e (one of the summands in Ey = dy e) such 
that ee’ 0. Thus some N Deand NE eI,. So Ne’ ¥ O.and NE, , being 
a subset of NE, must be a I, set. 

A set E is said to be locally IT, on a set e if for every N with Ne # 0 we 
have NE ell,. 

THEOREM 3.3: If is transfinite and if E ¢ II, there is an No eR such that E 
is locally II, on No. Furthermore if one assumes that the sum of two non-dense 
sets is non-dense then this is true also for @ finite. 
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For some No we have 
(3.5) ES >No, 


for if not, H — E; is non-dense and by Theorem 3.2 E is either non-dense, 
which is impossible, or else EH = E — Ey + E> is the sum of @ + 1 non-dense 
sets. If @ is transfinite then ¢ + 1 = ¢ and EeTI,, a contradiction. Also 
if @ is finite and the sum of two (and thus any finite number) of non-dense 
sets is non-dense then F is non-dense, which again is impossible. Thus we 
may assume the existence of an No satisfying (3.5). Now suppose NNo # 0 
and pick N; C ; then 


0#N,CNN, CN CE — Ei, 
and so 
N, C NE — E%). 
By B(iii) then 
0 Ni\(E — E3) CN(E — E>). 


Since N(E — E3)E; = 0 there is no subset e of N(E — E>) having the prop- 
erty that there exists an N’ D e with N’EeI,. In particular, since N > 
N(E — E3), we have NE ¢ II, which shows that E is locally IJ, on No. 

Any set which is either void or the sum of neighborhoods will be called open 
and the complements of open sets will be called closed.‘ The letter G, [§] will 
be used for the family of open [closed] sets. The symbol %, will be used for 
the family of sets Z having the property that N — E eI, whenever E is locally 
II,on N. From this point on we shall assume 
C Either $ is transfinite or the sum of two non-dense sets is non-dense. 

THEOREM 3.4: The family 84 has the following properties. 

(i) If Ea (a < ¢’) isa By set is Dracy: Ea- 

(ii) The complement of a By set is a By set and the product and difference of two 
B, sets are By sets. 

(iii) Every open and every closed set is a By set. 

(iv) For an arbitrary set E there is a set B ¢€ By covering E and such that if B’ 
is any By set covering E then B — B' ely. 

To prove (i) suppose 
(a) Dd Ee is locally I, on N. 

We have to show that N — Dvacy EaeIg. If this were not the case there 
would be (using C and Theorem 3.3) an No such that 


(b) N — > Eais locally II, on No. 


‘ Note that the closure Z of a set E is not necessarily a closed set. 
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From (b) and C, NNo # 0 and from (a) No Loa<e’ E.eIl,. Thus there is 
an a such that No#., € 114 and (Theorem 3.3) an No such that NoE,, is locally 
II, on No. Then, a fortiori, 


(c) E,, is locally IT, on No. 


Since Ea, €%4 we have from (c) that No — E.,¢I,. But No — a 
No — Doacg Ea and thus 
No- 
a<¢’ 
which, in view of the fact that NoNo ¥ 0, is a contradiction to (b). This 


completes the proof of (i). 
To prove (ii) let N be such that 


(d) *X — Eis locally IJ, on N. 


We wish to conclude that the sett NE = N — (X — E)isal,gset. If NE ell, 
there is an No such that NE is locally IZ, on No and thus E is locally ITZ, on No. 
Since EF «8,4, No — EI, which is a contradiction to (d) and the fact that 
NN, # 0. The rest of (ii) follows immediately from (i) and the formulas 


ED = X — [((X — E) + (X — D)], 
E-—D = E(X — D). 


To prove (iii) let F be a closed set so that F = X — G where G is an open 
set and let N be such that 


(e) F is locally IJ, on N. 


Then if N — F. = NG@eIJ¢, it is not empty and there is an N’ C NG; thus 
by (e) the void set FPN’ e II, which is impossible. Thus closed sets are B, 
sets and by (ii) open sets are also. 

To prove (iv) let E be an arbitrary set and let G be the sum of all N such 
that NE eI,. Place 


F=X-G, B=F+GE, 


then to see that the covering B of E is a By set it is sufficient in view of (i), 
(iii), and the fact that J, C By, to show that GE eI, ; and in view of B(iii) 
and Theorem 3.1 it is sufficient for this to show that GE is locally I,. This 
last fact is an immediate consequence of the definition of G. Now let B’ be an 
arbitrary 8, covering of E, G’ the sum of all N such that NB’ eI,, and F’ = 
X —G’. Thus G@’ CG, F’ DF and 


B— B’=F+GE—B' =F —B CF’ 


To complete the proof of (iv) it is sufficient then to show that F’ — B’ ¢1,. 
Suppose on the contrary that F’ — B’ «II, and thus F’ eII,. Let F’ be 
locally IZ, on N; then if N € F’ there is an N’ C NG’, and thus 0 = N’F’ eI, 
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which is a contradiction to B(iii). Thus F’ DN. It follows readily from the 
definition of G’ that B’ is locally I7, on F’ and thus since B’ ¢€ By we have 


(f) N(F’ — B’) eI, for every N CF’. 


Now since F’ — B’ was assumed to be JJ, there is an No such that it is locally 
II, on No, and thus in view of (f), MoG’ # 0. It follows that there is an 
N’ C N,G’ and since F’ — B’ is locally IJ, on No we have the null set 
0 = N'(F’ — B’) II, , a contradiction to B(iii). This completes the proof of 
Theorem 3.4. 


4. The operation Az, 


Let ® be the set of all denumerable sequences a = {a,} of ordinals < ¢’, i.c., 
n=1,2,--- implies a, <9’. For two different sequences a = {an}, a’ = fan} 
in ® let the distance between a and a’ be defined by the formula 


(a, a’) = 1/mfa, a’), 


where ma, a’] is the first integer m for which a, + a». This metric of Baire 
makes the space ®, as is well known (and readily verified), a complete metric 
space. Suppose now that for every a = {a,} in a certain subset &) of @ and 
every integer n there is a set Ka,...«, Which, as the notation indicates, depends 
merely upon the finite subsequence a;, az, --- ,@, of a and is thus not an 
arbitrary function of a and n. Such a function K = Kaq,...a, is called a kernel 
over). For every kernel K over ®9 the operation 


Ae = 2 K,,.--« 
ae n=1 
will then be defined. If Dt C B(X), by Ae,[Me] will be meant the family of all 
sets Ag,[K] where K = Ka,...a, is a kernel over &) with values in M. 
TuroreM 4.1: If ¢ = No and 4 is the sum of at most ¢ closed sels in & then 
is an invariant of the operation Ag, , 


Ae = Bo. 


While it follows from the theorem that By is an invariant of A», for every 
open set &y and every F, set &) in ® we do not know if this is true if ®p is, for 
example, an arbitrary Borel set in &. 

; Before proving the theorem we would like to point out that if » is closed 
and a = {an} — then there is an m = m(a, such that for every 
b = {8} €% one of the inequalities 


Bn Qn n= 1,2,---,m 


holds. This shows that a kernel K over a closed may be extended so as 
to be defined over all of & and satisfying the following conditions 
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(i) The extended kernel K’ takes on no values that K does not assume with 
perhaps the exception of the void set. 


(ii) Ag», |K] = Ao[K’]. 
To define | on we place — = 0 if there is no sequence in 4 starting 
with a, a2, ,a@, Otherwise place = Kg,...a,. Since is closed 


we have for every a e® — % the product []%-1 K%,..-«, void, and thus (ii) is 
true. As the reader will see presently the theorem when > is the sum of ¢ 
closed sets is an immediate corollary of the case where % is closed. Thus by 
the above remarks it would be sufficient to prove the theorem for the single 
case ®y = &. This can be proved by very slight modification of methods already 
known.’ Rather than use this argument we prefer to give a complete proof 
here for an arbitrary %) since this attack will bring out more clearly the one 
place in the proof where an assumption on ® is needed and may therefore 
throw some light on the general case of ) = an arbitrary Borel set. 

Proceeding now to the proof of the theorem we let K be a kernel over % 
with values in By , where » will for the time being, be subject to no restrictions. 
It may be supposed that fora = {an} €®y) we have 


(4.1) a=1,2,..., 


for if this were not true the kernel 
K’ = = 
m=1 


has the property of being regular expressed by 4.1, has, by Theorem 3.4 (ii), 
its values in By, and Ag,[K’] = Ae,[K]. 
By Theorem 3.4 (iv) there is a set B e By such that 


(4.2) A»,[K] C B, and 
(4.3) If B’ «By and As,[K] C B’ then B — B’ ely. 


For b = {B,} €% andi = 1, 2, --- let &o(b, z) be the set of a = {ap} in & for 
which 


Again by Theorem 3.4 (iv) there is for each 6 = {8,} €&) and each 7 = 1, 2, --- 
a set Bg,...s,¢€Bs depending only upon the finite sequence f,, --- , 6: and 
such that 


a¢ $o(b,t) 
and 


(4.5) if B’e®,and I] B’, then Bg,...9, — B’e Is. 


ae Oo(b,i) n=1 


5 See, for example, Kuratowski, Topologie I, Warsaw 1933; pp. 56-57. 
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It may be supposed that 
(4.6) Bg,...8; © Ka,...8, for bet, ¢=1,2,.--, 


for in view of 4.1 and the fact that the product of two By sets is a By set, the 
set Ks,...2;Be,...8; Satisfies all the conditions imposed on Bs,...,. Since by 4.2 


Ae = B — (B — 
and B is a By set it suffices to show that B — A»,[K]eZ,. Using 4.6 


(4.7) B- A», |K] = B- > II > i, 


i=l be i=l 


For 6 = {B,} €&), andz = 1, 2, --- let (6, 7) be the set of ordinals y < ¢’ 
for which there is a sequence of the form 


(Bi , Be, Bi, ¥, 


For i = 0 the symbol (6, 7) is taken to mean the set of all ordinals y < ¢’ for 
which there exists a sequence (y, --- ) starting with y and belonging to %. 
With this notation in mind we wish to establish the following inclusion: 


be Oo i=1 be i=0 € (b, 
where Bg,...2, = Bfori = 0. This inclusion will now be established under the 
assumption that ®p is a closed set in ® and the theorem will be proved for this 
case first. 
Suppose z is a point of B but does not belong to the right side of (4.8). Then 
for every b ey we have the following statement holding. 


(4.9) If 2 € Bg,..., there will be a y € (6, 7) such that x € Bg,...8;. 


Since it is known that zx e B we get, upon taking 7 = 0 in 4.9, a 7; € (6, 0) such 
that re B,,. Since y; € (6, 0) there is a sequenve 6; = (y1,---) in®. For 
b = 6, we have, upon setting 7 = 1 in (4.9), a yz in (6; , 1) such that ze B,,,, . 
Since y2 is in , 1) there is a sequence be = y2, ) in and we can 
repeat the argument and arrive at a sequence 


C= (1, 72,73, ) €®, € (n = 1, 2,---) 


and the sequence ¢ has the property that for every integer n there is a point 
Q, €% of the form an = (y1, 72, Gnti, ). Since it is assumed 
that is closed in the Baire metric of & we see that the sequence c belongs to %o 
and that consequently 


> Il Ba,...8; ’ 


i=1 


which establishes the inclusion (4.8). 
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Since as b ranges over ®p there are at most ¢' = osets 


(4.10) Boy. 24 


€ 7) 


there will be at most Nod = ¢ sets in the » > on the right side of (4.8). Hence 
to show that B — Ag,|K] J, it suffices, in view of (4.7) and (4.8), to show 
that 4.10, isa J, set. Now if 


(4.11) te IT 


ae Po(b,t) n=1 


there is a sequence a = {a,} such that 


(4.12) b = Be, Bi, 1, EB, 
and 
(4.13) a=1,2..., 


Let 6 be the sequence in (4.12), then upon using (4.4) with 7 replaced by 7 + 1 
we have 


a’ n=1 


Using (4.12) and (4.13) we see that a’ = (a2, a3, --- ) €& (6, 7 + 1) and thus 


from (4.11) and (4.14) that 2 e Bg,...8;2, , which proves that 


ae n=1 € (6,4) 
Since there are at most ¢ terms in the sum Eevee Bg,...8;y and each term is a 
By set we have by Theorem 3.4 (i) and (4.15) the right to use this sum as B’ 
in (4.5) (or (4.3) in case 7 = 0) and thus conclude that (4.10) is an J, set. Thus 
it has been shown that if ) is closed and the kernel K over © has its values in 
then A@,[K] By and so for closed , A,[Bs] C By. On the other hand 
it is obvious that As,[B,] D B,. This establishes the theorem for closed 
sets). In view of Theorem 3.4 (i) and the formula 


p TI II Kz,..-2, 


ae D n=1 n=l 


it follows immediately that A»,[B4] = By for any set ) which is the sum of at 
most ¢ closed sets. 


5. Continuous functions 


Throughout this section we shall be dealing with two spaces X, X, and con- 
tinuous mappings f(e) on $(X) to B(X;). Our chief interest will lie in the 
discussion of conditions sufficient to insure that the interior of the image of 4 
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neighborhood is not empty. In the first theorems to follow there is very little 
assumed explicitly about the spaces X, X;. What gives the hold on the situa- 
tion in these theorems is the assumption involving implicitly X, X,, and f 
and asserting that for every N eM there is a cardinal number ¢(N), finite or 
transfinite, such that f(N) ¢ Bg~mJZ ew). In the last part of this section we 
investigate conditions under which the existence of such a function ¢(N) can 
be established. 

By the ordinary closure E of a set E we shall mean the sum of all sets e having 
the property that Ne ~ O implies NE # 0. We shall find it necessary at times 
to restrict the space X or X, or both more than has heretofore been done. For 
this reason we give here modified forms A(ii)’, B’, C’ of A(II); B, and C with 
the understanding that when any of the conditions from A(o) to C’ are assumed 
this fact will be explicitly stated in the theorem. 

A(ii)’ The product of two open sets is open; where, as defined above, an open set 
is either the void set or a sum of neighborhoods. 

B’ ‘For every set E we have E = E. 

C’ The sum of two non-dense sets is non-dense. 

It is readily seen from the conditions A that the ordinary closure E satisfies 
the conditions B so that when one assumes A(o), A(i), A(ii) and B’ it is never 
necessary to assume B(i) or B(ii). Also C’ is implied by the conditions A(o), 
A(i), A(ii)’ and B’. The condition A(ii)’ may be stated in the equivalent form: 
A(ii)'’ If NN’E # 0 there is an N” such that N" C NN’ and N"E # 0. 

It is in this form that we shall most frequently use it. The final condition 
which will sometimes be used is 
R For every N there is an N’ such that N’ CN. 

Aset function f(e) on $(X) to B(X;) is said to be almost 1-1 in case of f(N)f(N’) 
is non-dense, whenever N, N’ are neighborhoods whose product is void, and it 
is said to be continuous in case for every e in B(X) and N, e Ni satisfying the 
conditions 0 # e, f(e) C N, there isan N eR such that Ne ¥ Oandf(N) CN,. 
The set function f(e) is said to be monotone if e C e’ implies f(e) C f(e’). Many 
of the properties of set functions which can be defined by point functions are 
not needed and will not be assumed. Thus for example, we do not assume 
that f (> éa) = bu f(en). It is assumed however in the last part of Theorem 5.1 
that subsets of the image f(E) of E are images of subsets of Z, a property 
— by every set function which is defined by a point function in the usual 
ashion. 

TuroreM 5.1: Let the space X with its topology N satisfy A and let the space X, 
with its family N, of neighborhoods and closure operation satisfy the axioms A, B, 
and C’. Let f(e) be a continuous monotone almost 1-1 set function with domain 
R(X) and range’ which satisfies the condition 


(i) IMC By 


* The range is the set of functional values. 
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Then 
(ii) For every N €Q there is an N; «MN, such that f(N) D Ny. 

If in addition it is assumed that the space X satisfies the regularity condition R, 
the space X, satisfies the condition B’, and the function f satisfies the condition 
S(P(E)) = BS(E)); then the three statements (i), (ii), and (iii) are all equivalent. 
(iii) For every non-void open set Gin X, f(G) = G; + E, , where G, is a non-void 
open set in X, and E, is a non-dense set which is closed in f(G). 

Let o(N) on be such that for every N eR. For each N 
there is by Theorem 3.3 an N, e Nt such that 


(a) S(N) is locally IT 4.x) on Ni 
(b) E, = — f(N) eI 


i Let E C X be such that f(ZE) = E,. It will now be shown that E CN. If 
there is a point z eZ — WN then there will be a neighborhood N(x) of z such 
that N(x)N = 0, f(N(x))f(N) is non-dense. Since f(x) ef(F) = E, CN, 
it may be supposed, using continuity and A(ii), that N° has been chosen so that 


(c) f(N®) CNi, —f(N*)f(N) is non-dense, C N(z). 
Now (ec) and (b) show that f(N°) C FE, + f(N*)f(N), and thus that f(N®) Ig. . 


Hence 

(d) (N°) < 9(N). 
{ bi i | | Using Theorem 3.3 and A(ii) on the space X; we can find an Nj ¢ 3t; such that 
Ni CN, and 

| (b’) Ni — f(N) . 


Since N; C Ni we have by (a) and (c) the set f(N)[Ni — f(N*)] = 
Nif(N) — f(N*)f(N) € II , and so Ni — f(N°) must be a II set. This 
shows in view of (b’) that ¢(N) < ¢(N°), which is a contradiction to (d). Thus 
we can say that E CN. Now N, CE, + f(N) and E, = f(E) Cf(N) and so 


N, Cf(N) + f(N) = f(N). 


This establishes the first conclusion of the theorem. i 
Since an open set G, and a non-dense set E, both belong to By for any ¢, . 
finite or transfinite,’ and for sufficiently small ¢ (¢ finite for example) b 
G, + E, e II, we see that (iii) implies (i). Since it has already been shown that : 
(i) implies (ii) it will be sufficient in proving the second assertion of the theorem 
to prove that (ii) implies (iii). / 
; 7 Theorem 3.4 (ii). Here C(i) must be satisfied since we have assumed that & = £ in 
if X,. Also we have assumed C(ii)’ in X, . A 
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It is clear that if X satisfies R we have for every non-void open G an N, € 2; 
such that N; C f(@). Let G, be the sum of all N; C f(@) and consider the set 


E, = f(@) —G. 


If not closed in f(@), i.e., if 0 E,f(@) — E, CG,, there is a neighborhood 
N, CG, (and hence Ni, = 0) such that N,[E,f(G) — E,] ¥ 0, so that NB, + 0 
and N,E, = 0, which is a contradiction to the definition of closure. 

If E, contains a neighborhood N,; we must have G:N, = 0 for otherwise 
there would be a neighborhood N : such that 


Ni CNG, 
Ni 
Mi Cc = 


which is a contradiction to the definition of H,. Hence we can say that if 
DN, then N,G, = 0. Let N, C FE, s0 that 0. Pick e C G so that 
fe) = NiE,. By continuity there is an N’ e N such that N’e ¥ 0, f(N’) CN,, 
and since e C G we have 0 # N’e = N’Ge. There is then an N” C N’G and 
so f(N’”) Cf(N’) CN,. But there is an Ni such that 


Ni Ni CM, 


and so G,N; = 0, a contradiction to A(i). Thus F, contains no neighbor- 
hood and the proof of Theorem 5.1 is complete. 

There are functions other than almost 1-1 functions for which conclusion 
(iii) of Theorem 5.1 holds, at least when applied to metric spaces. To illustrate 
a class of such functions we state the following definition. A function f defined 
on a metric space X and with values in an arbitrary class is said to be distributive 
in case for every positive number p there is a positive number qg such that if z, 
2’, N have the properties® 


(a) reNeMN,diamN <q, f(x) = f(r’) 
there will be an N’ such that 
(b) diam N’ f(N) =f(N’). 


Thus any 1-1 function will be distributive. Also any linear mapping between 
linear spaces will be distributive providing the metric in the linear spaces 
satisfies the condition (2, y) = (x — y, 0). Also any homomorphic mapping 
between metric groups will be distributive in case the groups have a left (or 
right) invariant metric. 

Turorem 5.2: Let X be metric and X, satisfy’ A, B’, and C’. Let f be a 


* The family 2 in the case of a metric space is taken as the family of all open spheres. 
A pi since C(ii)’ is implied by A, A(ii)’, B’ we may assume instead that X, satisfies A, 
nl)’, B’. 
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continuous distributive function with domain X and range X, which satisfies the 
condition 


{MN C 


Then for every non-void open set G in X we have f(@) = G, + E, where G, is a 
non-void open set in X, and E, is a non-dense set which is closed in f(G). 

For each 2, in X, let E,, be the set of all x « X for which x, = f(x). The 
sets E,, (a, « Xi) are then disjoint subsets of X and there is by the axiom of 
choice a set Xo C X such that for each 2, the set E,,Xo consists of a single 
point. The function fo(%) = f(x) defined on Xo is then a 1-1 map of X, 
into X,. Any set of the form fo {(N) where N eX (i.e., N is an open sphere) 
will be called a neighborhood No in the space Xo. For each N in 9 there will 
be then one and only one neighborhood No C Xo such that fo(No) = f(N) = 
f(No) but for a given No there may be many neighborhoods WN for which this 
equality holds. If 9% is the family of all neighborhoods No C Xo then it is 
clear that fo(Mo) CD54 BeIT, , and fo(ao) is a continuous mapping of Xp into X,. 
We wish now to establish certain properties of the Jt topology. It is clear 
that A(o), A(i) are satisfied. The topology %% also satisfies the condition 
A(ii)’ for if 2 ¢ NoNo where f(No) = f(N), f(No) = f(N’), there will be points z, 
x’ in N, N’ respectively such that f(a) = f(x) = f(x’). In view of the dis- 
tributive character of f there will be neighborhoods M, M’ of z, x’ respectively 
(M, M’ e%) such that M CN, M’ CN’ and f(M) = f(M’). Thus the neigh- 
borhood No = fo'f(M) = fo'f(M’) has the properties No Mo, C 
N.No. We wish next to show that the topology No satisfies R. Let No = 
fo f(N). Let x be a point of N and p > 0 be such that the open sphere 
S(z, 2p) with center x and radius 2p is entirely within N. Let M = S(z, p) 
so that M CN. Define My = fo'f(M), so that My e No. Let 29 € My where the 
closure is taken with respect to the No topology; i.e., every No of 2 intersects My . 
Let N(a) with diameter N(a) < gq. Let No(%) = fo f(N(2)) so that 
No(20) € No and 2 € No(a%). Then 0, and hence f(N(a))f(M) = 
fo(No(20))fo(Mo) # 0. There is then a y ¢ N(a) and a z eM such that f(y) = 
f(z). Let N’ «® be a neighborhood of z with diameter N’ < p and such that 
S(N’) = f(N(a)). Now for any 2’ ¢ N’ we have (2’, xz) S (2’, z) + (2, 2) < 
p + p = 2pso that N’ CN. This shows that 2 efo f(N(ao)) = fo f(N’) 
fo'f(N ) = Ny. Since x was an arbitrary point in the MN) closure of Mo we 
have My C Np and the topology Np) satisfies the condition R. The desired 
conclusion now follows immediately by applying Theorem 5.1 to the set function 
determined by fy on Xo to X. 

Our attention will now be turned to the problem of finding when the hypothesis 


IMC 2, 


holds for every continuous function f which takes neighborhoods into sets which 
are not non-dense. We have not been able to settle this question for any case 
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except the one where X is a complete metric space. One condition sufficient 
for the validity of f(M) € do4 Bell, is of the form s(x) < c(x) where s(x) and 
c(z) are numerical invariants of the point x called the separability character of 
the space X at the point x and the category character of the space X at the point x, 
respectively, which we now proceed to define. If a set H contains no points 
which are isolated in X then there is a smallest cardinal number c(£), called 
the category character of E, such that E is the sum of c(E) non-dense sets. If E 
contains a point x which is isolated in X (i.e., the set consisting of the single 
point z is an open set) then we write c(z) = wu, and read this as c(Z) is undefined. 
If for every neighborhood N of x we have c(N) = u then we put c(z) = u and 
otherwise define c(x) as the greatest lower bound of the numbers c(N) where N 
varies over the neighborhoods of x. The number c’(z) is then the first ordinal 
in the set c’(N) and the value c(z) is attained by the function c(N). By the 
separability character of an open set G in X we shall mean the smallest cardinal 
s(7) such that there exist open sets 


(a) G,, Ge, +--+ a < 


forming a basis for G;i.e., every open subset of G is the sum of the G,’s contained 
init,andG, CG. The number s(z) is then defined as the greatest lower bound 
of the numbers s(N) where N varies over the neighborhoods of x, and as before 
it is seen that s’(x) is the first ordinal in the set s’(N) and that s(x) is attained 
by s(N) for some neighborhood N of x. It is clear then that for every x in X 
there is a neighborhood No of x such that if N is any neighborhood of x which is 
contained in No we have s(x) = s(N), c(z) = c(N). We shall agree to write 
s(x) < e(x) for every x with c(x) = u. 

Turorem 5.3: If X is a metric space which is locally complete and locally 
separable then s(x) < c(x) for every x in X. 

This is nothing more than the well known category theorem of Baire applied 
to the case in hand. 

Unfortunately the locally separable spaces are, as far as we know, the only 
ones which satisfy the condition s(x) < c(x), and so what we have to say is 
essentially applicable only to these spaces. The range space X, , however, will 
be left quite arbitrary. A space is said to satisfy the condition (H) in case 
(H) for every pair x, x’ of points with x ~ x’ there are neighborhoods N, N’ such 
that x ¢N, N’, NN’ = 0. 

Turorem 5.4: Let X be an arbitrary” complete metric space and G an open 
sel of X. Let f(x) be a continuous function on X to X, where the space X, satisfies 
the conditions A, B, C’ and H. Then 


€ Baw 


If s(@) is finite G consists of a finite number of isolated points and since the 
theorem is easily verified in this case we shall assume that s(@) = No. 


” Le., we do not assume local separability or s(x) < ¢(z). 
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continuous distributive function with domain X and range X; which satisfies the 
condition 


{MC . 


Then for every non-void open set G in X we have f(G) = G, + E, where G, is a 
non-void open set in X, and E, is a non-dense set which is closed in f(G). 

For each 2x; in X, let E,, be the set of all x « X for which x, = f(z). The 
sets E,, (x: « X:) are then disjoint subsets of X and there is by the axiom of 
choice a set Xo C X such that for each 2 the set H,,Xo consists of a single 
point. The function fo(%) = f(x) defined on Xp is then a 1-1 map of X, 
into X,. Any set of the form fo f(N) where N ¢€ ® (i.e., N is an open sphere) 
will be called a neighborhood No in the space Xo. For each N in & there will 
be then one and only one neighborhood No C Xo such that fo(No) = f(N) = 
f(No) but for a given No there may be many neighborhoods N for which this 
equality holds. If 9) is the family of all neighborhoods No C Xo then it is 
clear that fo(Mo) C , and fo(ao) is a continuous mapping of Xp into X,. 
We wish now to establish certain properties of the Jt topology. It is clear 
that A(o), A(i) are satisfied. The topology Yt also satisfies the condition 
A(ii)’ for if x ¢ NoNo where f(No) = f(N), f(No) = f(N’), there will be points z, 
x’ in N, N’ respectively such that f(a) = f(x) = f(x’). In view of the dis- 
tributive character of f there will be neighborhoods M, M’ of x, x’ respectively 
(M, M’ «N) such that M CN, M’ CN’ and f(M) = f(M’). Thus the neigh- 
borhood No = fo'f(M) = fo'f(M’) has the properties No €%o, to¢No C 
NoNo. We wish next to show that the topology MN satisfies R. Let No = 
fo f(N). Let x be a point of N and p > 0 be such that the open sphere 
S(x, 2p) with center x and radius 2p is entirely within N. Let M = S(z, p) 
so that MCN. Define My = fo 'f(M), so that My «No. Let 20 Mo where the 
closure is taken with respect to the No topology; i.e., every No of 2» intersects Mo. 
Let N(a) with diameter < q. Let No(ao) = fo so that 
No(xo) € No and 2 € No(ao). Then No(ao)My # 0, and hence f(N(a))f(M) = 
fo(No(xo))fo(Mo) # 0. There is then a y ¢ N(a) and a z eM such that f(y) = 
f(z). Let N’ eM be a neighborhood of z with diameter N’ < p and such that 
S(N’) = f(N(a)). Now for any 2’ € N’ we have (z’, xz) S (2’, z) + (2, 2) < 
p + p = 2pso that N’ CN. This shows that a efo f(N(2)) = fo f(N’) © 
fo'f(N) = Ny. Since 2» was an arbitrary point in the MN) closure of Mo we 
have My C No and the topology Np satisfies the condition R. The desired 
conclusion now follows immediately by applying Theorem 5.1 to the set function 
determined by fy on Xo to X,. 

Our attention will now be turned to the problem of finding when the hypothesis 


BolT, 


holds for every continuous function f which takes neighborhoods into sets which 
are not non-dense. We have not been able to settle this question for any case 
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except the one where X is a complete metric space. One condition sufficient 
for the validity of f(M) © dos BolT, is of the form s(x) < e(x) where s(x) and 
e(z) are numerical invariants of the point x called the separability character of 
the space X at the point x and the category character of the space X at the point x, 
respectively, which we now proceed to define. If a set HZ contains no points 
which are isolated in X then there is a smallest cardinal number c(£), called 
the category character of E, such that E is the sum of c(E) non-dense sets. If E 
contains a point x which is isolated in X (i.e., the set consisting of the single 
point x is an open set) then we write c(z) = u, and read this as c(Z) is undefined. 
If for every neighborhood N of x we have c(N) = u then we put c(z) = u and 
otherwise define c(x) as the greatest lower bound of the numbers c(N) where N 
varies over the neighborhoods of x. The number c’(z) is then the first ordinal 
in the set c’(N) and the value c(x) is attained by the function c(N). By the 
separability character of an open set G in X we shall mean the smallest cardinal 
s(7) such that there exist open sets 


(a) G,, Ge, a < s'(G) 


forming a basis for G; i.e., every open subset of G is the sum of the G,’s contained 
init,andG, CG. The number s(z) is then defined as the greatest lower bound 
of the numbers s(N) where N varies over the neighborhoods of x, and as before 
it is seen that s’(x) is the first ordinal in the set s’(N) and that s(zx) is attained 
by s(N) for some neighborhood N of x. It is clear then that for every x in X 
there is a neighborhood No of x such that if N is any neighborhood of x which is 
contained in Ny we have s(x) = s(N), c(z) = c(N). We shall agree to write 
s(t) < e(x) for every xz with c(x) = u. 

TuroreM 5.3: If X is a metric space which is locally complete and locally 
separable then s(x) < c(x) for every x in X. 

This is nothing more than the well known category theorem of Baire applied 
to the case in hand. 

Unfortunately the locally separable spaces are, as far as we know, the only 
ones which satisfy the condition s(x) < c(x), and so what we have to say is 
essentially applicable only to these spaces. The range space X; , however, will 
be left quite arbitrary. A space is said to satisfy the condition (H) in case 
(H) for every pair x, x’ of points with x ~ x’ there are neighborhoods N, N’ such 
that xe N, x’ NN’ = 0. 

TuEeorEM 5.4: Let X be an arbitrary” complete metric space and G an open 
sel of X. Let f(x) be a continuous function on X to X, where the space X, satisfies 
the conditions A, B, C’ and H. Then 


€ 


If s(@) is finite G consists of a finite number of isolated points and since the 
theorem is easily verified in this case we shall assume that s(@) = No. 


” Le., we do not assume local separability or s(z) < ¢(z). 
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Let G be an open set and suppose that the sets Ga in the series (a) form a I 
basis for G. Let H be an open set such that H C G and define H, = HG, al 
so that the sequence tr 


(b) H, , Hs, a <s'(G) ( 


forms a basis for H. For certain points x in X there will be at least one de- 


numerable sequence ¢, of ordinals < s’(@) for which T 
(c) zeH,, A;, C S.(z, 1/n), n = 1, 2, 
where S.(x, 1/n) is the closed sphere with center z and radius 1/n (i.e., the ( 
set of all yin X such that (x, y) S$ 1/n; this is not necessarily the closure of the 
set S(x, 1/n) of y for which (x, y) < 1/n). For example if x is a point of H, i 
; there is for each n = 1, 2, --- a sphere S(x, 2r,) C HS.(a, 1/n) and since (b) if 
i is a basis for H there is an ordinal ¢, < s’(@) such that x e H;, C S(z, r,) and so e 
xeH;, rn) S(a, 2rn) C S.(2, 1/n). 
i There may be points besides those in H for which such sequences exist. Let H* 
FH i be the set of all points x in X for which at least one sequence satisfying (c) exists. 
i] Then, obviously, 
| (d) HCH*CH CG. 
i ‘i i li Let Z be the space, discussed in §4, of all denumerable sequences {f,,} of 
VAM ot ordinals ¢, < s’(G). For every x in H* we define Z(x) as the family of all 
it {fn} Z for which (c) holds. Since for ; = {f,} €Z(x) we have 
diameter [] ;, < diameter A;,, < diameter S.(x, 1/n) < 2/m, 
n=1 
the set [[%_. H;, reduces to the single point z. This shows that for every x 
and y in H*, with  ¥ y we have Z(x)Z(y) = 0. It will now be shown that P 
the set 
(e) Z(H*) = Z(2) 
ze H* 
is a closed set in Z. Suppose 3, = {>} is a sequence of points in Z(H*), with 
jn — 30 = {£3}. From the definition of the metric in Z one can find a sequence ¢ 
m, — © such that 
Since 2, y e H*, x ¥ y implies Z(xz)Z(y) = 0 we have z, in H* uniquely deter- ] 
mined by the condition 3, € Z(2,), and thus ‘ 
(g) tne Hy S.(an, 1/p) p = t 
Thus by (f) we have 
(h) tne C 1/p) ?= 1,2, +++, Mn. 
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It is obviously no loss of generality to assume that the sequence m, is monotone 
and thus from (h) we have for r S n the set Witte Aye containing both z, and 
z,, and so for r S 2, it follows also from (h) that 


(z,, tn) diameter Il < diameter A diameter S.(z,, 1/m,) 2/m,r. 
p=1 


This shows that the sequence {z,} is a Cauchy sequence. Let 2% = lim z,. 
All that remains to be shown is that 2 « H* and that 2 «¢Z(x). These two 
statements can be verified simultaneously by showing that 

(i) , 1/p) p = 1,2,-:- 


For a fixed p we have m, > p for sufficiently large n, and thus (by (h)) 2, ¢ A;2 
if n is large. This shows that a ¢ H;0 for every integer p. Also by (h) for 
every y € Aye, (y, tn) S$ 1/p if m, = p and thus (y, %) <1/p. This establishes 
(i) and completes the proof that Z(H%*) is a closed set. 

For every integer n = 1, 2, --- and every 3 = {f>} in the closed set Z(H*) put 


= IL S(H;,) 
p=1 
so that 


Suppose 3 corresponds to x ¢ H*, i.e., 3 ¢ Z(x), then using (c) and the continuity 
of f we have 


f(z) ef(A;,) C f(H;,) 1, 2, 
and thus f(z) « [[%.4f(H;,). Let let m = f(x) and suppose that there is another 


point 2; * 2, also in []%,f(H (H;,). It is readily seen from the condition H 


which was assumed to hold in X, that there is a neighborhood Ni containing ry 


and such that z, —e Nj. Since 2; ef(H;,) for every n we have N if(H;,) # 0 
and hence points x, for which 


f(an) «Ni, €H;,, 4, 
Since 
tn eH;, C Ay, 1/n) n= 1,2,.--- 


the point z, > x. Since — N{ there is a neighborhood N, of 2; such that 
N,N; = 0 and by continuity we must have f(z,) ¢ N; for all wullicionthy large n 


Which is a contradiction to the fact that «Ni. Thus t,) reduces 
to the single point f(z) and thus 


S(H*) = II K;,...:, = Aza [K], summed for Z(H*). 
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It is readily shown that f(H;,) is the complement of an open set and thus by 
Theorem 3.4 the kernel K,y,...;, which is defined over Z(H*) has its values 
in Bc). By Theorem 4.1 the set f(H*) is a Be) set. To each z in G assign 
) an integer n(x) such that S(z, 2/n(x)) C G, and let 
Hn = S(z, 1/n(2)), 


where the sum is taken over all z in G for which n(x) = m. Let x ¢ H» and 
Zp —> 2%. Then for each p there is an x” with 


n(x?) =m, 2peS(x?, 1/m). 

For sufficiently large p, (%», %) < 1/2m and thus for large p 
(x”, S (x", Lp) + (Xp, Xo) < 1/m + 1/2m < 2/m, 

so that 

| x € S(x”, 2/m) = S(x”, 2/n(x”)) CG. 

Thus H, C G, and using (d), 


Hn CAn CG. 
From the definition of H» we see that = = Gand thus 

| / i | {i is, by Theorem 3.4, a 8.) set. This completes the proof of Theorem 5.4. 6. 
i) ey ia l Now let x, = f(x) be a continuous mapping of one topological space X onto 5 
on he Ty another space X, and suppose that in both spaces EF = E. Suppose f is such “ 
Pease! a that for every neighborhood N the set f(N) is not a non-dense set. Then « 
since f(Z) C f(E) we see that E must be non-dense if f(Z) is. Likewise if (i 
S(E) ie., M 
f(E) = >> EX, EX non-dense, (i 


= 
we may take E* C Eso that f(E*) = Ef and E = E*. Since f(E’) = fi A 
is non-dense so is H* and thus E eI,. In other words, for continuous functions b 


which take neighborhoods into sets which are not non-dense we must have : 
t 

S(N) ITs @ < c(N). th 

As pointed out above there is for every x a neighborhood N, containing 2 lo 
such that for every neighborhood N of x with N C Np we have s(x) = 3(N), (i 


c(z) = c(N). Fora fixed z let 2, be the set of all N in N for which z eN CNo, - 
and let 2’ = %.. The family then determines (assuming A(ii)’) the 
same topology (i.e., the same notion of open set) as that determined by the 
larger family 9%. We can say then that every N eX’ contains at least one Co 
point x for which s(x) = s(N), c(x) = c(N). These equalities of course do not 
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have to hold for all points in N and thus it is not true that for every neighbor- 
hood N eX’ of a point x we have s(x) = s(N), c(x) = c(N). Suppose the 
locally complete metric space X has the property s(x) < c(x) and that the 
family It (i.e., the set of open spheres) has been replaced by the set of N’ € 2’ 
for which N’ is complete. Let f(x) be a continuous function on X having the 
property that open sets transform into sets which are not non-dense. Then 
for every N’ e Qt’ there is at least one point z« N’ such that s(N’) = s(x) < 
c(t) = c(N’) and so by the preceding remarks we have f(N’) ¢ IZ). But 
by Theorem 5.4, when proper restrictions are placed on X, , we have f(N’) € Baw) 
so that 


SN) C By 


The following theorem then follows immediately from Theorems 5.2, 5.3, 5.4. 

TueoreM 5.5: Let X be a locally complete and locally separable metric space, 
and let X, be a Hausdorff space." Let f(x) be a continuous distributive function 
which maps X into the entire space X, in such a way that non-void open sets in X 
are transformed into sets which are not non-dense.” Then for every non-void open 
set Gin X we have f(G) = G, + E,, where G, ts a non-void open set in X, and 
E, is a non-dense set which is closed in f(G). 


6. Locally homogeneous spaces 


In this section we shall make an application of the results of §5 to a situation 
somewhat more general than an additive function on a group. 
6.1. Preliminary lemmas. All of the spaces considered in §6 are neighborhood 
spaces in the sense of Hausdorff, i.e., we have a family 9t of non-void sets N, 
each of which is called a neighborhood of any one of its points, and the following 
assumptions are made. 
(i) IfN.,N ; are neighborhoods of z then there is a neighborhood M, of x with 
M. CN.N,. 
(i) If ¥ y there are neighborhoods N, , Ny of x and y respectively with N,N, 
= 0. 
Any sum of neighborhoods is called an open set and the family of open sets will 
be denoted by @. The closure £ of a set E is the set of all x for which N,E ¥ 0 
for every neighborhood N, of z. When we are dealing with two spaces X and X, 
the neighborhoods and open sets in X, will be distinguished from those in X by 
the subscript 1. Besides being a Hausdorff space we assume that the space X is 
locally homogeneous in the following sense. 
(ili) § is a family of homeomorphisms y = h(x). Each of these homeomorph- 


" Le, a space satisfying A(o), A(i), A(ii)’, B’ and H. 

® This last assumption and the assumption of local separability (or s(x) < c(z)) may of 
Course both be replaced by the single assumption that for every neighborhood N of a point x 
we have S(N) € IT .¢2) 
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isms h e § has for its domain of definition D(h), an open subset of X. The 
range R(h), i.e., the set of functional values of h is also an open subset of X. 

(iv) The identity function x = h(x) defined throughout X is an element of §. 
(v) For every pair a, b of points in X there is one and only one hw « having 
the properties a D(hes), beR(ha), 6 = haw(a). We shall sometimes write 
(a, b; x) for h(x). 

(vi) The function (2, y; z) is continuous in its arguments; and by this we mean 
that if z ¢ D(h.,) and N is a neighborhood of (z, y; z) then there are neighbor- 
hoods N,, N,, Nz of x, y, 2 respectively such that if x’ N,, y’ «Ny, 2’ 
have 2’ D(hzy’) and (2’, y’; 2’) eN,i.e., (Nz, Ny;Nz) CN. 

It is clear that Hausdorff linear spaces, groups, or fields of point sets are locally 
homogeneous spaces as defined above. For linear spaces the family may be 
taken as all functions of the form a + 2; for groups all functions of the form az 
and for fields of point sets all functions of the form a + x — az. In each of 
these examples we have D(h) = R(h) = X for every he. Perhaps the 
simplest example of a locally homogeneous space for which D(h) # X is the semi- 
group X composed of all real positive z, where the family is taken as all func- 
tions of the form h(x) = a + x where ais positive or negative. Here the domain 
D(h) is the set of all positive z > —aand the range R(h) is the set of all positive 
x > a. It is desirable to consider such examples as the last one mentioned 
from an abstract point of view for there are many places in analysis where a 
homomorphic mapping from such a semi-group onto the ring of bounded linear 
operators on a Banach space presents itself and in such a way that it cannot be 
extended to the whole group.” 

The case of the positive real axis is an instance of the following general type 
of locally homogeneous space. Let G be a Hausdorff group (written additively) 
and X an open subset of G (perhaps not containing the identity and thus not a 
sub-group) such that a + b e X whenever a,b eX. Let the neighborhoods in X 
be those neighborhoods in G which are contained in X. Suppose that X is such 
that (a + X)X ¥ OforeveryainG. Let § be the family of all functions of the 
form h(x) = a+xwhereaeG. The domain D(h) is then X(—a + X) and the 
range R(h) = a + X(—a + X). 

Lemma 1: The space X is regular.” 

Let z be a point of the neighborhood N. Since x = (zx, x; x) we have in view of 
(i) and (vi) a neighborhood M of x such that (M,M;M) CN. Let z bea point 
of M. In view of (iv) and (v) hzz = Azz and thus x = (z,2;2) ¢«M. There will 
then be a neighborhood N, of z such that (N., N.;2) CM. Since z e M there 


18 The reader is referred in this connection to several papers of E. Hille. For example, 
Notes on linear transformations. I, Trans. Amer. Math. Soc., 39(1936), pp. 131-153; On semi- 
groups of transformations in Hilbert space, Proc. Nat. Acad. Sci., 24(1938), 159-161 ; Notes on 
linear transformations. II, Annals of Math. (2) 40(1939), pp. 147. 

4 Le., for every neighborhood N of a point x there is a neighborhood M of z such that 
Mc N. See A. Kolmogoroff, Studia Math., vol. 5(1934), p. 31, where this property is 
proved for groups. 
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isa point ye MN,. Since z and y are both in N, we have (y, z;z)«M. Now 
if u = (y, 2; x) we have by (v) hy: = he and thus z = (y, 2; y) = (x, (y, 2; x); y) 
eN. 

Lemma 2: For any pair x, 2%» there is a neighborhood N, of x such that N, C 

D(he'z,) for every x’ Nz. 

From (ii) we see that there is wm" least one neighborhood of every point. The 
desired conclusion follows immediately from (i) and (vi). 

Lemma 3: If N and N’ are neighborhoods of a point x and tf xo is a point of N 
then there is a neighborhood N"’ of the point x with N’’ < NN’ and such that h(N"’) 
CN for everyh with N” C D(h) and x h(N”’). 

Since % = (x, % ; 2) € N there is by (vi) a neighborhood N” of x with N”’ C 
NN’ such that N” C D(h,z,) for every x’ « N” and such that (N”’, x ;N”’) CN. 
Suppose ha = h issuch that N” C D(h), €h(N’’). Then for some zx’ 
we have % = (a, b; x’) and thus by (v) we have h = ha = hz’, and so h(x’’) = 
(2’, ; 2") forevery Thush(N”) CN. 

A function f defined on X and with values in an arbitrary Hausdorff space 
X, is said to be §-distributive in case whenever E C X is such that f(E) is a 
neighborhood in X, then f(h(#)) is open for every he § with E C D(h). An 
additive (not necessarily continuous) operation between linear spaces is an 
example of such a distributive function. 

Lemma 4: Let X, be a Hausdorff space and f continuous and $-distributive from 
X toX,. Then if for every neighborhood N in X, f(N) has an interior point, the 
function f will be an interior transformation, i.e., f transforms open sets into open 
sets. 

Let N be a neighborhood of 2). We shall show that f(N) contains a neighbor- 
hood of x; = f(a). Let N,, HC N be such that f(N) DN, = f(£). Fix zin 
E and, using continuity together with Lemma 2, pick a neighborhood N’ of x 
such that f(N’) CN, and N’ C D(hzz,) for every point x’ of N’. Using Lemma 
3 choose N’’ C NN’ with z in N” and such that h(N’’) C N for every he S 
with D(h) and eh(N”). Pick Ni so that f(N”) D Ni, and 2” «N”, 
so that 21 = f(x"). Leth = , then since x” N’’ C N’ we have 
N" C D(h) and a h(N’’) and thus h(N’”’) CN. Now there is a set CN” 
such that x” f(E’) = Nj and thus, since f is $-distributive, f(h(E’’)) is 
open. Hence we have 2 = f(x) = f(h(2")) ef(h(E")) f(N). 
Since f(h(E’”’)) is open f(N) contains a neighborhood of 2; . 

6.2. The equivalence of comparable topologies in X. It is known that if a linear 
space is of type (F) under each of two distances and if convergence in one of the 
two metrics implies convergence in the other then the two metrics are equivalent 
and define the same topology.” This, of course, is equivalent to saying that a 
1-1 continuous linear map of one (F) space onto the whole of another is neces- 
sarily a homeomorphism. As mentioned in the introduction the same theorem 
has been proved for groups but under the assumption that each of the distance 


“8. Banach, Théorie des opérations linéaries, Warsaw 1932, p. 41, Theorem 5. 
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functions makes the group complete, metric, and separable. If (x, y) and 


il ) (x, y): are two distance functions defined on the same set of points then the 
) statement that (2, , z) — 0 implies (x, , 21) — 0 is equivalent to the statement f 
| that @, C G where G, G, are the families of open sets determined by the distance 
| functions (x, y), (x, y): respectively. It is this latter form, i.e., G; C G, that ( 

we prefer to use in stating the assumption regarding the comparison of the two g 


) topologies. We shall state here two additional assumptions either one of which 
| when combined with @, C © will yield the desired conclusion, i.e., ©, = G. 
(H;) X is a locally complete and locally separable metric space and 9t is the family 
of open spheres in X. The family 9, is a family of subsets of X forming a 
Hausdorff space (i.e., Jt; satisfies (i), (ii)). For every N eM we have N not non- 
dense in the Nt, topology. 

i (Hz) In this assumption X remains general (i.e., is not necessarily metric) and 
N,isasin H,. The assumption Hz is then N C The families , 
i II}, are the Baire sets and second category sets resp. (with respect to the cardinal 

¢) in the J, topology. 

In §5 it is shown that H; implies H,. It is obvious that H2 does not imply 
| H, so that H is the more general assumption but has the disadvantage that it is 
| not as easily applied as H,. Incase X is a complete metric and separable group 
H | and a new topology N, C G is introduced which makes X a Hausdorff group of 

| second category in itself then it is clear that H, and He are both satisfied. 

. THEOREM 6.1: Suppose that a new topology defined by a new neighborhood family 
/ Ni C G has been introduced in X forming a Hausdorff space X, which is also locally 
homogeneous with the same family $. Then either H, or Hz will imply the equality 
i G, = G. 
i The identity map x = f(x) on X to X; is H-distributive, for if N; € It, then 
h(N,) Furthermore since Jt; C the function f is continuous. Since f 
is 1-1 we have, by Lemma 1 above and Theorems 5.1 and 5.5, an interior point 
in f(N) for every N in N. Thus by Lemma 4, N = f(N) €@: and so © C 
©, CG. 

6.3. Additive mappings between groups. A mapping x, = f(x) of one group X 
onto another X,; (both groups written additively) is called additive if f(x + y) = 
f(x) + fly). Such a function is necessarily §-distributive in the sense used 
above. 

THEOREM 6.2: Let X; be a Hausdorff group and X a locally complete and locally 
separable metric group with a one-sided invariant metric.'® Let f be a continuous 
additive function with domain X and range X, which takes spheres into sets which 
are not non-dense.” Then f is an interior transformation, i.e., f takes open sets 
into open sets. 


16 For example (xz — y, 0) = (x, y). In view of the result of Kakutani, Proc. Imp. Acad. 
Jap., 12(1936), pp. 82-84, this hypothesis may be dispensed with. See also G. Birkhoff, 
Compositio Math., 3(1936), pp. 427-430. Birkhoff does not state explicitly the invariant 
property of the metric but it can be easily obtained from his work. 

17 This property is automatically satisfied if X is separable and X, is of second category 
in itself. 
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ind The one-sided invariant metric shows us that f is distributive in the sense in 
the which that term is used in §5. The present theorem then follows immediately 

ont from Theorem 5.5 together with Lemma 4 above. i 
nce Finally it might be mentioned that Lemma 4 enables one to state immediately 

rat (i.e., without recourse to §5) a theorem of the above type for locally compact 

wo groups in which case the whole situation is much simpler. 
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ON CONTINUOUS MAPPINGS OF MANIFOLDS INTO SPHERES 


By EILENBERG 
) (Received June 19, 1939) 
Introduction’ 


I. We shall consider in this paper continuous mappings of an (n + 1)-dimen- 
sional finite and oriented manifold’ M"*" into the (m + 1)-dimensional spherical 
manifold S”*’. 
Let p and q be the poles of A mapf(M"*") will be called regular 

if the sets ‘(p) and are (n — m)-dimensional polyhedra.’ 

i (A) Each mapping f(M"*') C S”"** is homotopic to a regular mapping g(M""') 

For each regular map f(M"*") C S"*" two (n — m)-dimensional cycles with 
integer coefficients: "(f) inf ‘(p) and "(f) in f ‘(q), will be defined so as 
to satisfy the following conditions: 


(C) If g is a regular map homotopic to f, then "(g) ~ “(f). 

4 (D) Given a cycle such that ~ "(f) in — f-'(q), there is an ( 
Be, (n — m — 1)-dimensional subpolyhedron P”—" of M"** and a map g homotopic s 
to f such that! 
i g =f '(@); va "(9) = va “(f). 
‘ |e ee Denoting by I'(f) the element of the (n — m)-th homology group of M"" . 

determined by y;, "(f), we deduce from (A) and (C) that I'(f) is a homotopy Q 
a invariant which may be regarded as defined for any map f(M"*") C 8” not 


necessarily regular. 
II. Let n = 2m, and let f(M°"**) C S”*" bea regular map such that I'(f) = 0. 
We have then ~ 0 and by (B) also "(f) ~ 0. The linkage coefficient 


fo 
is therefore well determined. Ir 
1 The paper is arranged so that all the main results are stated in the introduction which (E 
is divided into five parts. In five sections which follow the introduction, the corresponding sor 
proofs are given. 
* See K. Rerpemerster, Topologie de Polyeder, Leipzig 1938, p. 151. 
| 3 With respect to a certain subdivision of M"*!, The term k-dimensional when applied 

i to subpolyhedra of M**! will always mean at most k-dimensional. ca 
j 4|y"-™ | is the smallest (n — m)-dimensional polyhedron containing 7". 38 
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(E) If g is a regular map homotopic to f, then c(g) = c(f). 
(F) If m is even, c(f) = 0. 

It follows from (A), (C) and (E) that c(f) may be regarded as a homotopy 
invariant defined for any map f(M*"*") C S”*? such that I'(f) = 0. 

The invariants I'(f) and c(f) have been introduced and investigated by Hopf.’ 

Ill. The property (D) announced above is essentially new and will make 
available a closer discussion of the mappings f(M"*') C S"*' for m <n S 2m. 

Turorem I. If m < n < 2m then for a given mapping f(M"*') C S”** the 
following properties are equivalent: 

(i) T(f) = 0; 

(ii) there is a mapping g homotopic to f such that g'(p) and g‘(q) are 
(n — m — 1)-dimensional polyhedra; 

(iii) there is a mapping g homotopic to f such that g ‘(p) is a (n — m — 1)-dimen- 
sional polyhedron. 

THEOREM II. Given a mapping C S™*"(m > 0) the following proper- 
ties are equivalent: 

(i) e(f) = 0;° | 
(ii) there is a mapping g homotopic to f such that g'(p) and g™‘(q) are (m — 1)- 
dimensional polyhedra; 
(iii) there is a mapping g homotopic to f such that g ‘(p) is a (m — 1)-dimensional 
polyhedron. 

Let a’”** be a simplex of M’”*? and let C7'*’ and Cz'*" be two oriented convex 
(m + 1)-dimensional simplicial cells inside a’”*’. The oriented boundary 
spheres will be denoted by si' and sz , the corresponding cycles by of and o2'. 
If ss’ = and if v[oy , = 1, we shall say that and s2' form an elementary 
couple. 

TuEorEM III. Consider an elementary couple sj, 8: in M°’"*'(m > 0) and 
two integers a, 0, a2 0. Given a mapping C S™** we have e(f) = 
aay ,” if, and only if, there are two (m — 1)-dimensional polyhedra P™™* and 
Q”" and a mapping g homotopic to f such that 


= st = mor, 


IV. The conditions in Th. I-III can be simplified if M"™’ is subject to the 
following homology condition :’ 


(H) B‘(M"**) coef = 0 fori = 1,2,---,n —m-— 1. 
In the case n = 2m condition (H) takes the following form: 
(H’) B‘(M?"**) coef (m?"~‘) = Ofori = 1,2,---,m— 1. 


°H. Horr, Math. Ann. 104 (1931), pp. 637-665, and Fund. Math. 25 (1935), pp. 427-440. 

‘This includes = 0. 

” (m*) stands for the k-th homotopy group of S". The group is abelian and therefore 
can be taken as a coefficient group for B‘(M"*1), See W. Hurewicz, Proc. Akad. Amsterdam 
38 (1935), p. 113. 


| 

| 

cal 

lar 

+1 

) 

ith 

as 

4 

i] 

= 

| 

ntl if 

py 

10t al | 

| 

ich 

ied 


664 SAMUEL EILENBERG 


Assuming that M”™ satisfies (H) we restate our theorems as follows: 4 
TuroreM Ia. If m < n < 2m then for a given mapping f(M"*') 5 
the following properties are equivalent: 
(i) T(f) = 0; fe 
(ii) there is a mapping g homotopic to f such that g '(p) and g™*(q) are single ii 
points ; li 
(iii) there is a mapping g homotopic to f such that g"‘(p) is a single point. 
TuEorEM IIa. Given a mapping f(M’""') S""(m > 0) the following 
i | properties are equivalent: 
(i) e(f) = 0F i 
(ii) there is a mapping g homotopic to f such that g‘(p) and g‘(q) ‘are single 
points ; 


I (iii) there is a mapping g homotopic to f such that g ‘(p) is a single point. 
if TurorEM IIIa. Consider an elementary couple sj, sz in M°’"** (m > 0) and 


fi two integers a, ~ 0, a2 ~ 0. Given a mapping f(M°"™"') C S”** we have c(f) = te 
if ayae’ if and only if there is a mapping g homotopic to f such that a 
| g = sf Y>(9) = aor T 
=, 2G) = ae? . 
He hil i The condition (H) is always satisfied if m = n + 1. It follows that Th. Ia p 
can always be applied to mappings f(M"*?) S"\(m > 1) and that Th. IIa 
Me and IIIa hold for mappings f(M*) C 8°. is 
Og The groups (m") for k > m are generally unknown. They have been calcu- 
lated® in the cases k = m + landk = m+ 2. tl 
wit ay Using these results we verify that: be 
Th. IIa can be applied to mappings f(M°) C provided B'(M*) = 0 (coef. 
integers). 
Th. Ia can be applied to mappings f(M"**) C S”"**(m = 3) provided B'(M"*’) he 
= 0 (coef. mod 2). 
Th. Ila and IIIa can be applied to mappings f(M") C S* provided B’ (M’) is 
= 0 (coef. mod 2). 
Th. Ia can be applied to mappings f(M""*) C S"*!(m = 4) provided B’(M"”*) m 
= 0 (coef. mod 2). Us 
V. The most important case when (H) is always satisfied is when M"" = 
S""*. It follows that Th. Ia-IIIa can then always be applied. Using certain of 


operations introduced by Freudenthal” and Hopf" I shall be able to restate 
these theorems in a more geometrical way. 

Let S"** = E?*' + E™* be a decomposition of S”*' into two hemispheres 
such that: S" = E?*'.E™* is the equator of S”*', p is the “center” of ET” 


8 L. Ponrrsacin, C. R. Acad. Sci. URSS 19 (1938), pp. 147-149, 361-363. 

4 ® Theorem IIIa has no application in this case because of (F). 

10H. FREUDENTHAL, Compositio Math. 5 (1937), pp. 299-314. 
1H. Horr, Fund. Math. 25 (1935), pp. 427-440. , 


n+l 
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and q is the center of E™"’. Similarly we have a division S"*' = E}** + E*” 
such that S" = E7*’.E"™. Let p’ and q’ be the “centers” of E}*' and E%*'. 

Given a mapping ¢(S") C S” we define a mapping F*(S"*’) C S”*" by the 
following condition: for each point x ¢S" the radius p’,x of E}*’ is mapped 
linearly onto the radius p, (x) of E"' and the radius q’,z of E"** is mapped 
linearly onto the radius g, g(x) of E™**. 

The operation F* was established by Freudenthal"’ who also proved the fol- 
lowing two theorems: 

TueorEeM Ib. If m < n < 2m then for every mapping f(S"*') C S”*" there 
is a mapping g(S") C S”™ such that F* is homotopic to f. 

TuEoREM IIb. Given a mapping f(S’"*') C S"*"(m > 0) we have c(f) = 0 
if and only if there is a mapping ¢(S’") C S” such that F* is homotopic to f. 

Note that according to (F) we have c(f) = 0 if m is even. 

A mapping g(S” X S”) C S” of the topological product S” X S” is said” 
to be of the type (a1, ae) if for a given t eS”, S” X t is mapped with the degree 
a, andt X S” is mapped with the degree az . 

Let E”*" be the full sphere bounded by S”, and let o be the center of E”*’. 
The boundary E”** x + S” X of the product X E”*’ may be 
regarded as identical with S°"*". Given a map ¢(S" X S”) C S” we define a 
map x + x E”*') by the following condition: for each 
point (2 , 22) S” X S” the radius (0, 22), (a1 , 2) of X is mapped lin- 
early onto the radius p, 9(21, 22) of and the radius (21, 0), (x1, 22) of X 
is mapped linearly onto the radius qg, 22) of 

The operation H* has been introduced by Hopf" who also proved the theorem 
that if g is of the type (a1 , az) then c(H*) = aaz. The following theorem may 
be considered as its inverse. 

THEOREM IIIb. Given a mapping f(S°’"*') C S"*"(m > 0) such that c(f) = 
a-a2 ~ 0 there is a mapping g(S” X S”) CS” of the type (a1, a2) such that H* is 
homotopic to f. 

For even m it was proved by Hopf™ that each mapping g(S” X S”) C S” 
is either of the type (a: , 0) or of the type (0, az). 

On the other hand Freudenthal™ has recently announced a theorem that if 
m is odd and & is an integer, there is a map f(S’""’) C S”"’ with e(f) = k. 
Using Th. IIIa we obtain that 

If mis odd and ay , a2 are any two integers, there is a mapping o(S” X S”) CS” 
of the type (a , az). 

This solves a problem proposed by Hopf.” 


I. Definition and Properties of y> "(f) 


1. Proof of (A). It is well known that there is a simplicial map 
9(M"") < S"* homotopic to f. The simplicial division of S”** may be chosen 


H. Horr, loe cit., p. 431. 

* Ibid., p. 486. See also H. Wurrney, Ann. of Math. 39 (1938), p. 428. 
“H. Frevpenrnat, Proc. Akad. Amsterdam 42 (1939), p. 140. 

Ibid., p. 436, 
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so that p and q are inside points of some (m + 1)-dimensional simplexes. It is 
easy to verify that g is regular. 


2. Let f(M’™’) C S”*' bea regular map. We consider M"*" in a subdivision 
such that and f are subcomplexes. To this simplicial division there 
corresponds a dual cell division so that to each k-simplex a‘ in M"* there cor- 
responds a (n + 1 — k)-cell b"*?™. 

Let G be an arbitrary abelian group. Given a k-chain A* = >} a;a! with 
integer coefficients, and a (n + 1 — k)-chain B"*** = >> g,b?*** with coeffi- 
cients in G, we shall consider the intersection coefficient (which is an element 
of @) x[A*, = afi. 

For each (m + 1)-dimensional chain = 6,b7*' we have 
— f‘\(p) and therefore C — p. It follows that 
the local degree in the point p, g[f, B”""’, p] «G, is defined. If B”*’ is a cycle 


then the degree g[f, is defined and g[f, = g[f, pl. 


3. Definition of y;"(f).. Let a?~” be a simplex of f-'(p). We consider the 
integer a; = g[f, 67°", p] and the (n — m)-chain y>"(f) = aa?” inf “(p). 
Since = a: = glf, p] for each dual cell therefore 


(1) "(f), = pl for each chain = 
and 
(2) xly> "(f), = of is a cycle. 


Let b"** be a dual (m + 2)-cell. 9b”** is then a (m + 1)-cycle and 
g[f, ab"**] = since ~ 0. By (2) we have then x[y* "(f), ab”*’] = 0 
and therefore x[ay> "(f), b”*?] = 0. Since this holds for each (m + 2)-cell 
b”*? therefore dy, "(f) = 0 and "(f) is a (n — m)-cyele. 

Similarly the cycle y7~"(f) in f-'(q) is defined, and (1) and (2) hold. 


4. Proof of (B). Applying (2) to y> "(f) and vy; "(f) we find that 
= x62-"(f), for each cycle This implies” 
(f)~v “(f). 


5. Proof of (C). For each cycle B”*' we have g[g, B”*"] = g[f, There- 
fore by (2) "(g), = xb> "(f), This implies” y>""(g) ~ 
Y> "(f). 


6. We consider S”*' as the set of points = t2, Of 
the euclidean (m + 2)-space, defined by the condition xj + 2 +--+ 
+ = 1. Let us put U(x) = forz e The point p is then 
defined by the condition: /(p) = 1, the point q by l(q) = —1, the equator s” 
by the condition I(x) = 0. 


16 By Poincaré’s duality theorem. 


+} 
i | 
‘ 
i 
7 
; 
| 
i 
ayn 
j 
& 


It is 


sion 
here 
cor- 


vith 
effi- 
lent 


ave 
that 
ycle 


the 
(p). 


fore 


ON CONTINUOUS MAPPINGS OF MANIFOLDS INTO SPHERES 667 


For each point « « S""' — p — q we define 
r(z) = pets 9) 
This way we obtain a mapping r(S"** — p — gq) C S” which is a “radial’”’ 
y 
projection. 


Given a point y eS” and a number —1 < 1 < 1 there is a point x = 
ly, L} € S”** — p — q uniquely defined, such that r(x) = y, U(x) = 1. 


1. Proof of (D). Let V be the subcomplex of M"** consisting of all the 
simplices of M"*' which have no vertex on f ‘(q). Let U be an open set such ; 
that CU CU CM™ — V. Let P™™ =p), X = — U, 
Z = U-X, and g(x) = r[f(z)] fora eX — We obtain this way a map 

Let be a (n — m)-simplex of and its dual cell. Since 
V CX — Z therefore gl¢, = p] and according to the definition 
of "(f) we then have y; "(f) = ale, ab?7Ja?~". It follows that 


“"(f) = 7” "(v) where y” "(¢) is a (n — m)-cycle attached to the map 
which I have defined in another paper.” 

The relation y” ~ y> "(f) in implies ~ "(f) i in 
VCX — Zand therefore y"" ~ y" "(v) in X — Z. Therefore by a lemma”™® 
concerning the cycle y” ”(¢) Aidy is a (n — m — 1)-dimensional polyhedron 
CX — Z and a map — |y" "| — C S™ such that 
= 7" Wa) = if x eZ. 

Let k(x) be a continuous function defined for «M ke) 1, 
and such that ; | 


k(x) = —1 ifandonlyif z 
k(x) = 1 ifandonlyif rely" "| + 
k(x) = U[f(z)] if rel. 
We define the map g(M"*") C S”"* by taking 
g(x) = f(z) for 
g(x) = {y(x), k(z)} for —U—|y"™|—- 
g(x) =p for rely” 


The condition g“(p) = |y"""| + and = are obviously 
Since r[g(xz)] = (x) for each x — therefore y; "(g) = 
= Since = f(x) for U = CU therefore 


= ™(f) and there is a neighborhood U, of in such that 


'T Fund. Math. 31 (1938), p. 183. 
Thid., p. 198. 
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g(t) =f (x) if xeU,. The existence of such an implies easily that g 
and f are homotopic. 
Note that (C) and (D) can be restated replacing p by g and vice versa. | 


8. It follows from Poincare’s duality theorem that the homology class I'(f) 
of y; ”(f) is defined by the property (2). In particular ['(f) = 0 is character- 
ized by the condition g[f, B”*’] = 0 for each cycle B"™. This means that i 
I(f) = 0 if and only if the map f(M""’) C S”” is algebraically inessential. 

If n= mtheny; ”(f) is a 0-dimensional cycle of I'(f) which can be considered 
as an integer. It follows from (2) that T(f) = glf, M -= : 


Il. Properties of c(f) 

1. Proof of (E). Let C be the subset of S”*’ defined by the condition 
—} Let 7 >0 bea number having the following property: if x eC, ( 
yeS”*' and |z — y| < nthenyeS”” — p — qand | r(z) — r(y) | < 2. 
i Given two regular and homotopic maps f and g there is a finite sequence of t 
maps fo = f, fi, fo, fx = g such that | fi(x) — < nif 
and i = 0, 1,---,4 — 1. By (A) we may admit that all these maps are 


. | i regular. The proof of (E) reduces therefore to the case when t 
Let = R, + Re be a decomposition of into two polyhedra such 

that f"(p) C Ri — C Re — Ri, f(Ri-R2) CC. Writing = 
and ¥(r) = r{g(z)] for Ri- Re, we have by definition of 7, | — (x) | < 2 f 
if It follows that and y are homotopic and therefore that f 
ale, B”] = gly, B”] for each cycle B” C In particular if is a 

chain in R, such that 2B”** C R,- we have gly, 2B”*"] = gly, This 
implies g[f, B"*’, p] = glg, p] and by (1) = xly>(g), s 
By duality theorems” we then have yp(f) ~ yp(g) in Ri — Re. Similarly q 

vi(f) ~ vr (g) in Re — R, and therefore c(f) = c(g). 

i 
2. Proof of (F). Let h(S”"*’) = S8”** be a homeomorphic map of degree 1 s 
such that h(p) = g and h(q) = p. Let g(x) = Alf(x)]. As it easily follows I 

from the definition, we have y;(g) = v7(f), = y>(f). Since f and g are 

homotopic we have c(f) = c(g), and therefore c(f) = vly>(f), va(f)] = 
However, if m is even = xland 
thus e(f) = 0. ( 

III. The Case m < n S 2m 

1. We shall often use the following elementary t 
Lemma 1. Let R be a subpolyhedron of M"* and R" a k-dimensional sub- is 
polyhedron of Given a ke-cycle — R — ky + ke <n, then t! 


~ 0 in — R implies ~ 0 in — R — R®. “ 


; be 19 See L. Pontrsacin, Math. Ann. 105 (1931), p. 190; also Ann. of Math. 35 (1934), p. 312. 
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2, We shall consider m-cycles y" in M’"*" having the following property 
(b) each m-simplex of | y" | has in y™ a coefficient +1; for each cycle y~ in | y™ | 
with coefficients in a group G there is an element a €G such that yi = ay”. 

Using duality theorems” we obtain the following 

Lemma 2. Let y" C M°™" be a cycle having the property (b) and such that 
y"~ 0. Given a cycle yi — |y"| (coefficients in G) such that ~ 0, 
we have yi ~ O in — | y™ | of and only if oly", yr] = 0. 


3. Lemma 3. Given a mapping f(M’"*') C S”* such that '(f) = 0, there 
is a (m — 1)-dimensional subpolyhedron P”~' of M°"*' and a regular mapping 
homotopic to f such that y>(g) has the property (b) and g'(p) = | ¥p(g) | + 

Proor. Let g be a normal map homotopic to f. Since ['(g) = I'(f) = 0 
therefore y7(g) ~ 0. By a general theorem” there is a cycle y" C M*™** — 
g '(q) having the property (b) and such that y" ~ y>(g) in — g‘(q). 
Using (D) we obtain a polyhedron P”' and a regular map g; homotopic to g such 
that gr'(p) = |v" | + = 


4. Proof of Th. I and II, (i) => (ii). We first find a regular map g homotopic 
tof such that y7 "(g) ~ Oin — g“(p). 

If n < 2m we may take g to be any regular map homotopic to f. Since 
Tg) = T(f) = O therefore y/-"(g) ~ 0 and by Lemma 1 we have also 
"(g) ~ Oin — g“(p). 

If n = 2m we chooseg so as to satisfy Lemma 3. Since c(g) = c(f) = 0 there- 
fore v[y> (9), ¥c(g)] = 0. The cycle y;(g) satisfying the condition (b) it follows 
from Lemma 2 that y7(g) ~ 0 in M’"™' — | y3(g) |. Applying Lemma 1 we 
obtain ¥7(g) ~ 0 in — | | — P™* = — g'(p). 

The existence of g being thus proved, there is by (D) a (n — m — 1)-dimen- 
sional polyhedron Q”"-"" and a regular map g: homotopic to g such that 
= 

Since T'(g:) = 0 we have y; "(g:) ~ 0 and by Lemma | also 7, "(gi:) ~ 0 
in M"™* — Q"-"". Using (D) once more we obtain a (n — m — 1)-dimen- 
sional polyhedron P”-"* and a map gs homotopic to g; such that gz'(p) = 
g2'(q) 

(ii) => (iii). Clear. 

(ili) => (i). Without restricting the generality we may admit that the map 
ggiven by (iii) is regular. We then have y> "(g) = 0 and therefore I'(g) = 0 
(and c(f) = 0 if n = 2m). 


5. Proof of Th. III. Necessity. Let I(f) = 0 and c(f) = mae. Because of 
the homogeneity of M?”*? it is enough to prove that the condition of Th. III 
Is satisfied for some elementary couple s? , 8: which we shall construct during 
the proof. 


*°'S. EILENBERG, See abstract No. 267, Bull. A. M. S. 46 (1940), p. 416. The paper in 
full was to have appeared in Fund. Math. 33 (1939). 
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Let g be the map given by Lemma 3, and let y” be a cycle in M*"" — g'(p) 
such that y” ~ 0 in and that = a. By a general theorem” 
there is a cycle yi having the property (b) and such that yi ~ y” in 
— Since vfyn(g), = a1 and vfy(g), v7 (g)] = therefore 
according to Lemma 2 we have ~ yr(g) in — |y3(g) |. Using 
Lemma 1 we therefore obtain azyi ~ ¥;(g) in — g"'(p). Applying (D) 
we obtain a (m — 1)-dimensional polyhedron Qi’ and a regular map gi homo- 
topic to g such that gi(q) = | + = Now, let bea 
simplex of | yi |, bi"? its dual cell, si the boundary of bi*' and of the cycle 
ab; *'. We may admit that at is oriented so as to have o[o7 , yi] = 1 and 
therefore , = a1. On the other hand we have c(gi) = and 
therefore = = (g1), It follows that v1] 


= a,. By Lemma 2 we then have aot ~ y>(g:) in — | | and by 
Lemma 1 also aot ~ in — | | — QP = — gi'(q). Ap- 


plying (D) we obtain a (m — 1)-dimensional polyhedron P”™ and a regular 
map gz homotopic to g; such that go'(p) = st + yp(g2) = aot. Let 
a; be a simplex of sj’ , 03’ the dual cell of az’, s?' the boundary of az’ and let 
os = Clearly , = a2. On the other hand we have c(g:) = 
and therefore = vfy>(g2), ¥a(g2)] = , (g2)]. It follows that 
viol , v7 (g2)] = a2. Since of has the property (b) and | oT | = sf , therefore 
by Lemma 2 ~ y7'(g2) in — sf and by Lemma 1 also azo?’ ~ (gz) 
in — gf — = — Applying (D) we obtain a (m — 1)- 
dimensional polyhedron Q”' and a map g; homotopic to gz such that’ 

gs'(p) = (p) = = (2) = 

gs'(q) = + Ya (gs) = a20%' . 
This finishes the proof, since sj and s3 obviously form an elementary couple. 

Sufficiency. Since of ~ O therefore y>(g) ~ 0 and Tg) = 0. Since 

vlot , o2] = 1, therefore c(g) = , = 


IV. The Condition (H) 


1. Lemma 4. Given a regular mapping f(M"**) C S8”** such that f~ ‘(p) = 
— where is a (n — m — 1)-dimensional polyhedron, a 
(n — m)-dimensional polyhedron such that 


— f"(q)] mod P”™ coef (m"*") = 0 
for 1=0,1,---,n—m-—l, 
there is a mapping g homotopic to f such that 
g'@=f'@, ve "G) 


Proor. Since f'(q) is a polyhedron and since M"*' — f-7(q) is connected, 
there is a neighborhood U of f-'(q) such that 


B‘(M"*' — mod coef = 0 for 


i 
if 
| 
| 
iy i | 
! 
t 
t 
‘ 
t 
h 
— 
Bs: 
ill T 
aig 
; 
Ne { 


ON CONTINUOUS MAPPINGS OF MANIFOLDS INTO SPHERES 671 


Let X = — U,Z = U-X, and ¢(z) = r[f(z)] forz eX — — P™™", 
We obtain this way a map g(X — P™””" — P™”""') CS". Since B'(X — Z) 
mod coef (m"*") = for i = 0, 1,--.,n — m — 1 therefore” there 
isa map — C S”™ such that = ¢(z) if x eZ. 

Let k(x) be a continuous function defined for c «M"*', —1 < k(x) S 1, and 
such that 


= —1 if andonlyif z ef '(q), 
= 1 ifandonlyif reP””, 
k(x) = Uf(z)] if reU. 
We define the map g(M"*") by taking 
g(x) = f(x) for 
g(x) = {¥(x), k(z)} for — U — 
g(x) =p for 


Conditions g = and g ‘(q) = f ‘(q) are obviously satisfied. Since 
g(t) = f(x) for xe U and g“(q) = f ‘(q) U, therefore = ve "(f) 
and there is a neighborhood U, of g in S”*’ such that g“(z) = f(a) ifxeU,. 
The existence of such an U, easily implies that g and f are homotopic. 


2. Now let us admit that M”*" satisfies (H) and that P””” is non-vacuous, 
connected, and such that 


(Hz) coef = 0 for i=1,2,---,n—m—1. 


From (H) using Lemma 1 we obtain B‘[M"*’ — f-'(q)] coef (m”™"") = 0 
for? = 1,2,--.,n—m-—1. From this and from (Hz) it follows that condi- 
tion (H;) is satisfied. 


3. Proof of Th. Ia and Ila. (i) => (ii). By Th. I and II we may admit 
that f-'(p) and are (n — m — 1)-dimensional polyhedra. Let 
consist of an arbitrary point of f‘(p). Conditions (He) are then satisfied and 
therefore condition (H1) of Lemma 4 is satisfied. There is therefore a map g 
homotopic to f such that is a point and g = f “(q). Repeating the 
same argument with f replaced by g and with q replaced by p we find a map m1 
homotopic to g and such that gi'(p), gi '(q) are single points. 

(ii) => (iii). Clear. (iii) => (i). Follows from Th. I and II. 


4. Proof of Th. Illa. Necessity. Let g be a map homotopic to f given by 
Th. III. Taking P” = sf we see that conditions (Hz) are satisfied. Condi- 
tion (Hi) is therefore satisfied too and Lemma 4 can be applied to g. We 


Fund, Math. 31 (1938), p. 184. 
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obtain therefore a map gi homotopic to g such that gi'(p) = st, gr'(q) = 
= + = ve(g) = Since sf is a sphere and 
st , therefore = aot . It follows that c(g:) = vfy>(g:), = 
vlact , a22] = aaz. On the other hand c(g:) = c(g) = cae and therefore 
a We have proved therefore that = = act . Repeating 
the same argument with g replaced by g: and p replaced by q we obtain a map 
gz homotopic to g; and such that 


=F, Ye.) = aw? 
Sufficiency follows from Th. ITI. 

V. Mappings of into 


1. Lema 5. Given a regular mapping f(M"*") S8”** and a decomposition 
mM" = + Xe into two closed sets such that @, 


| S”*! — p, there is a mapping g homotopic to f such that I 
g(X1) C g(X2) C E™". 


Proor. Let 7 < 1 be a number such that zx e f(X;) implies l(z) > —7y and 
x ef(X2) implies l(z) < 7. 
Let k(x) be a continuous mapping of the interval —1, 1 into itself such that 


k@=2 if c23(n+ 1), 
= 0 if 
Sz forall 


We define a map s(S”"') C S”” by taking s(p) = p, s(q) = q, if x = {r(z), U(a)} 
then s(x) = {r(x), k[l(x)]}. The map g(x) = s[f(x)] satisfies the conditions of 
Lemma 5. 


2. Proof of Th. Ib and IIb. By Th. Ia and Ila it is enough to prove that 
given a map f(S"™’) C such that = p, = there is a map 
g(S") C S” such that F* is homotopic to f. 

Since f(E2™) C — q and — p there is by Lemm: 5 
a map g homotopic to f and such that g(E2"’) C C It 
follows that g(S") C S”. Let o(S”) C S” be the map of S” thus defined. It 
is easy to see that for each x « S"** we have | g(x) — F*(x) | < 2 and therefore 
that g and F* are homotopic. 
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3. Proof of Th. IIIb. Since 0 X S” and S” X 0 form an elementary couple 
in EB" x S™ + S" X E™" therefore by Theorem IIIa it is enough to prove 
that each map X S" + S™ C S™** such that 


f'(p) =0x 8", yo(f) = a(0 X o”), 
= 8S" x0, v7 (f) = X 0). 


(where o” is a basis cycle for S”) is homotopic to a map H* where ¢(S" x S”™) 
Cc S"is a map of the type (a1, ae). 

Since f(E”™ x S") C — q and f(S” X — p therefore by 
Lemma 5 there is a map g homotopic to f such that 


g'(p) = 0 8”, = X o”), 
g (9) = 8" x 0, Ya (g) = X 0), 
x S”) c Ep", g(S” 4 E”™*) 


It follows that g(S” X S”) C S”. Let o(S” X S”) C S”™ be the map 
of S” xX S” thus defined. We then have | g(x) — H*%(x)| < 2 for each 
X S" + 8S" X and therefore g and are homotopic. 

For each te S” the element E”*’ X ¢ has an intersection coefficient 1 with 
0 X o” and therefore an intersection coefficient a; with y>(g) = ai(0 X o”). 
It follows by (1) that g[g, Z2”** X t, p] = a and therefore gly, S” X t] = a. 
Similarly we obtain gly, t X S”] = a2. Therefore ¢ is of the type (a: , a2). 


UNIVERSITY OF MICHIGAN. 
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obtain therefore a map gi homotopic to g such that 9: '(p) = st, 9 (q) = 


= + Q™", = = Since sf is a sphere and 
, therefore y5(g1) = aot . It follows that c(g:) = v[y>(g1), v7 = 
vlact , a2] = aaz. On the other hand c(g:) = c(g) = ayae and therefore 
a =a. We have proved therefore that = = at . Repeating 
the same argument with g replaced by g: and p replaced by q we obtain a map 
gz homotopic to gi and such that 


gz = si, —¥p(g2) = 
g2'(q) = 8, = aro? . 
Sufficiency follows from Th. III. 


V. Mappings of into S”* 


1. Lemma 5. Given a regular mapping f(M"*!) C S”*! and a decomposition 
= X, + Xz into two closed sets such that f(X:) C — gq, f(X:) 
S”™*" — p, there is a mapping g homotopic to f such that 


ve 9) => 
g'@=f'@, va "g)=ve "(f), 
g(X1) C g(X2) C EZ". 


Proor. Let 7 < 1 be a number such that 2 ¢ f(X;:) implies I(x) > —7 and 
x implies < 7. 
Let k(x) be a continuous mapping of the interval —1, 1 into itself such that 


if x= 3(n+ 1), 
k(jz) = 0 if xy, 
k(jz) Sx forall 


We define a map s(S”"’) C S"" by taking s(p) = p, s(q) = q, if = {r(@), U(2)| 
then s(x) = {r(x), k[l(x)]}. The map g(x) = s[f(x)] satisfies the conditions of 
Lemma. 5. 


2. Proof of Th. Ib and IIb. By Th. Ia and Ila it is enough to prove that 
given a map f(S""’) C S”** such that f'(p) = p, f(g) = q there is a map 
g(S") C S” such that F* is homotopic to f. 

Since f(E2™) C — q and C — p there is by Lemm: 5 
a map g homotopic to f and such that g(E2"’) C C It 
follows that g(S") CS”. Let g(S”) C S” be the map of S” thus defined. It 
is easy to see that for each x « S"** we have | g(x) — F*(x) | < 2 and therefore 
that g and F® are homotopic. 
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3. Proof of Th. IIIb. Since 0 X S” and S” X 0 form an elementary couple 
in BE"? X S" + S" X E™" therefore by Theorem IIIa it is enough to prove 
that each map X S™ + S™ E™**) C such that 


f'(p) = 0x 8", ¥3(f) = X o”), 
= 8" x0, = X 0). 


(where o” is a basis cycle for S”) is homotopic to a map H® where ¢(S”" < S™) 
c S" is a map of the type (a1, ae). 

Since xX S") — q and f(S” X — p therefore by 
Lemma 5 there is a map g homotopic to f such that 


g'(p) = 0X 8”, ¥>(9) = a(0 X o”), 
g = S" X 0, = X 0), 
g(E”"™ x S”) Cc Ee", g(S” x 


It follows that g(S” X S”) C S”. Let o(S” X S”") C S”™ be the map 
of S” xX S” thus defined. We then have | g(x) — H*(x)| < 2 for each 
xX S" +S” X and therefore g and H® are homotopic. 

For each t eS” the element E”*’ X ¢ has an intersection coefficient 1 with 
0 X o” and therefore an intersection coefficient a; with y>(g) = a(0 X o”). 
It follows by (1) that glg, Z”*’ X t, p] = a and therefore gly, S" X t] = a. 
Similarly we obtain gly, t X S”] = ae. Therefore ¢ is of the type (a: , ae). 


UNIVERSITY OF MICHIGAN. 
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ON p-ADIC FIELDS AND RATIONAL DIVISION ALGEBRAS’ 
By A. A. ALBERT 
(Received January 20, 1940) 


1. Introduction 


The known results” on the existence of abelian extensions, with prescribed 
properties, of an algebraic number field have been obtained as a part of the 
class field theory. This latter theory is certainly of great importance but is 
undeniably of tremendous complexity. It would thus be very desirable, if 
possible, to obtain a simpler and more algebraic derivation of the theorems on 
abelian extensions mentioned above. 

The corresponding theorems on abelian extensions of p-adic number fields 
have been obtained® as analogues, in the local class field theory, of those on 
extensions of algebraic number fields. This treatment is again undesirable, and 
moreover, is at least partly unnecessary. The class field theoretic treatment 
also hides the really simple results.which it is possible to obtain on the structure 
of arbitrary fields whose ramification order is prime to p, as well as the fact that 
abelian extensions of a prescribed degree over a p-adic field § may not even exist. 

We shall use a very simple lemma‘ of what may be called an arithmetical 
nature on p-adic fields and shall give a consequent new algebraic discussion of 
fields & of finite degree and ramification order e prime to p over a p-adic field §. 
We shall construct all such fields, show that the number of such fields inequiva- 
lent over § is an explicitly determined divisor of e, and shall determine necessary 
and sufficient conditi®ns that & be normal over §. We shall determine the 
automorphism group of & over §, and prove the rather amazing result that the 
existence of any abelian field of the given ramification order e over § implies 
that all fields with the same ¢ are abelian over §. Finally we shall determine 
explicitly which abelian fields are cyclic, how many cyclic fields exist which are 
inequivalent over §, the direct factorization into two cyclic fields of all non- 
eyclic abelian fields. 

An extension of our treatment to abelian fields of ramification order not prime 
to p over a p-adic field might be obtainable but would probably be so complicated 


1 Presented to the Society December 28, 1939. 

2 See [7] of the bibliography at the end of this article. 

* In [8], [28]. See also the theory of class fields as given in [10], [6], [9], and of local 
class fields in [6), [21], [24], [25]. 

‘A more general form of this lemma is due to G. E. Wahlin and was first proved by 
him in [29]. We use this more general form later in considering fields of arbitrary ramifica- 
tion order, but believe it desirable to make the theory of p-adic fields of ramification order 
prime to p depend upon as little p-adic theory as possible. Hence we indicate the very 
simple proof of this more elementary lemma. 
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that it might be just another form of that given by the local class field theory. 
However some results are obtainable simply. We shall consider fields & of 
residue-class degree f and ramification order ¢ over a p-adic field § and write 
¢ = p'@ With é prime to p. We shall then show that & is completely ramified 
of degree p’ over a subfield of ramification order e and residue-class degree f 
over ¥. We shall use this result to give new simple proofs of the theorems’ 
stating that all normal extensions of a p-adic field § are solvable, and that there 
are only a finite number of extensions of a given degree inequivalent over §. 

It is known’ that if D is a generalized quaternion division algebra over a 
quadratic extension & of the rational number field ® then D has a maximal 
subfield R(x), 2” = ain ®. . To obtain a generalization of this result we may 
consider a normal division algebra D of degree m over its centrum and let R 
have degree r over an algebraic number field §. Then we ask whether or not 
there exists a maximal subfield R(x) of D such that the minimum function of x 
has coefficients in § and the field §(x) is cyclic over §. This question is equiva- 
lent to that about the existence of a cyclic field 3 of degree m over § such that 
the composite of 3 and & splits D. We shall show that 3 may, in general, 
not exist. We shall, indeed, find necessary and sufficient conditions that 3 
exist. In particular 3 always will exist if m is prime to r. 


2. Notations and elementary results’ 


Let p be a rational prime integer which will be fixed until we say otherwise. 
We speak of the derived field ®, of the field 9 of all rational numbers as the 
field of rational p-adic numbers. If § is any algebraic extension of ®, we call 
the quantities of § p-adic algebraic numbers. 

We shall consider a field § of finite degree over R,. The set J; of all integers 
of § has a unique prime ideal ps and we let gz be the number of elements in the 
residue class field Hy = Jx.— px. This field H; is then a finite (Galois) field 
GF (qx), where gg is a power of p. 

We shall make our discussion relative to a fixed prime integer P; of §. Then 
bs = (Ps), every prime integer of § has the form uP; where wu is a unit of Js. 
We shall speak of the units of the integral domain J; simply as units of §, the 
prime integers of § as primes or prime quantities of §. 

Every p-adic field § contains a primitive (q; — 1) root of unity which we 


* The first of these theorems is due to Hensel and is stated in an equivalent form and 
proved in [16] by the use of the method developed by Hilbert in his theory of ramification 
fields. The second theorem is stated by Krasner in [18] with methods of computing the 
number of fields in [19], [20]. He indicates without details that the theorem itself follows 
from the results of [17]. 

; a is a result in [1]. It is generalized to algebras over an algebraic number field 
in [2], 

, 7 We use the exposition of p-adic theory of [8], [4] as a basis for our present theory. 
Note that the very interesting results of Hensel in [11]-[16] could be simplified greatly if 
based upon the more modern exposition of these two texts, due essentially to Hensel, H. 
Hasse, and O. Ostrowski. 
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shall fix throughout our discussion and designate by ¢;. Every quantity a of 5 
is expressible in the form 


(1) a = 
where A and uz are rational integers and u is a unit of § such that 
(2) u = 1 (mod pz). 


The expression of a in the form (1) is unique, that is, if also a = P§ t§u for u = 1 
(mod p;) then \ = a, nh» = B, u = uw. The integer d is called the order of a. 
The integers of § are those quantities (1) with \ 2 0, and the units of § are 
the quantities with X = 0. We call units of § satisfying (2) principal units 
of §. The set of all such units forms a subgroup of the multiplicative group 
of all units of §. 
We now let & be a field of degree n over a p-adic field §. The meaning of 


(3) = Je — de 
of & is clear, and we have 
(4) n= Pe = , qe 


where e¢ is the ramification order of 8 over §, f is the residue-class degree of & 
over §. The field & contains a primitive (qe — 1)" root of unity {», and we 
shall write ; 


+6, 

and shall assume throughout our discussion that 

(6) = 

If now Pg is any prime quantity of & the expression of any A of & in the form 
(7) A = Pagel 


for rational integral \ and yu, and U a principal unit of &, is unique. 

We note that every principal unit U ¥ 1 of & has the form U = 1 + P#Us 
for Up a unit of R, Ay > 0. We then prove 

Lemma 1. Let m be a rational integer prime to p. Then 1 is the only principal 
unit of R which is an m™ root of unity. 

For if U ¥ 1 then U = 1 + P¥Uo, mUy is a unit U; of &, U" 
1 + + where B is an integer of Then — 1 

+ 

As a simple consequence of the lemma just proved we have 

Lemma 2. Let m be prime to p and A in & be such that A” = Pete. Thenm 
divides } = \yom and up = wom, and A = 

For A = Py¢%U for a principal unit U of 8. Then A” = Pe °¢e?U" = 
Pere. Since isa principal unit of R we have my = A, = =1. 
By Lemma 1! we have U = 1. 
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The case X = 0 of the result above gives 

Lemma 3. Let m be prime to p and U be in & and such that U™ is a power 
of fe. Then U ts a power of Se. 

We also have 

Lemma 4. Let m be prime to p. Then K contains a primitive m root of 
unity ¢ if and only if m divides r = qe — 1, £ is a power of fe. 

For if ¢ = ¢g and 7 = mr the quantity ¢ is a primitive m“ root of unity in &. 
Conversely if ¢ in & is a primitive m* root of unity we have ¢”" = 1 = ¢% and 
Lemma 3 implies that ¢ = ¢g. The cyclic multiplicative group generated by 
tg has order 7 and has the cyclic subgroup of order m generated by ¢, m divides r. 

Our final preliminary lemma will be found to be fundamental in our later 
discussion.” 

Lemma 5. If m is prime to p and U is a principal unit of & there exists a 
principal unit V in & such that V™ = U. 

For let f(z) = 2” — U. Since U = 1 (mod pe) we have f(z) = x" —1= 
(x — 1)go(x) (mod pg). Here go(x) is a polynomial of degree m — 1 of He{z] 
and is prime toz — 1. For 2” — 1 is separable when m is prime to the charac- 
teristic p of Hg. By Hensel’s Lemma there exists a quantity V in J¢ such 
that f(x) = (x — V)g(x), V = 1 (mod pg). But then f(V) = 0, V is a principal 
unit of R such that V™ = U. 

The properties derived above are not properties of & relative to § but are 
what we may call absolute properties of R. They then hold for any subfield § 
over R, of R. However, we shall also require some elementary properties of & 
relative to 

The field W = §(£e) is unramified of degree f over §, that is, Py is a prime 
quantity of W. Also fe = fm. If Wo is any unramified subfield over § of K 
its residue class degree over § is a divisor fo of f, the degree fo of Wo over F 
divides the degree of Y&, Wo is equivalent over § to a subfield W, of W. But 
YW, is normal over §, YW; contains all subfields of & which are equivalent over § 
to itself, Wo = W, < W. We have shown that W is the maximal unramified 
subfield over § of & in the sense that zt contains all subfields of R unramified 
over 

A well known theorem’ states that & = YW(P) for any prime quantity P of & 
and that & is completely ramified of degree e over §. Now (P)° = ps = (P3), 
P= UP; fora unit U of 8. Since R = W(P) we have U = a +a,P + --- + 
for a;in Then P is a root of 


(8) f(x) = 2° — — --- — — = 0. 


We conclude that f(z) must be the minimum function over % of P and must 
be irreducible in YW. But P is an integer of & and the coefficients of f(x) are 
Integers b; = Psa; of W. The residue-class degree of & over W is unity, 
New(P) = --aPs has order unity, a is a unit of QW. The ideal (P)’ = pw 


‘This is the lemma referred to in our introduction. 
*See Chapter IX of [4]. 
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divides P’ — Psa) = + P*(bo + --- + beaP*”), divides bP, (P) 
divides b; and b; cannot be a unit of YW. Hence b; = P xa; for a; an integer of R. 
Continuing in this fashion we ultimately prove that all the a; are integers of Q. 
This is the first part of” 

TurorEM 1. Let W be the maximal unramified subfield over § of & over §, 
P be any prime quantity of &, e be the ramification order over § of R, Px be a prime 
quantity of §. Then the minimum function over BW of P is 


(9) f(x) = + Peco + + + 


for integers c; of BW, co a unit of W. Conversely tf co is a unit and the remaining 
c; are integers of a field W unramified over § any stem field R = W/E) of (9) has 
ramification order e over § and YW as maximal unramified subfield, & is a prime 
quantity of R, U = + +--+ + is a unit of 

For the proof of the second part of our theorem we note that any root & of 
f(x) = Ois an integer of R = Then so is U, & = must be divisible 
by pe. But cis a unit of W and of K and is not divisible by peg , U = m + £4 
is not divisible by pe and is a unit of ®, the ideal (¢)° = (—UP s) = pw. Hence 
the ramification order of & over W is at least e, the degree of R over BW is at 
most e, R has degree and ramification order e over Y%. Evidently then f(z) 
is irreducible in YW, (€) = pe , £is a prime quantity of §. Since & is completely 
ramified over 8 the field YW is the maximal unramified subfield over § of &. 


3. Fields with e prime to p 


The structure of algebraic extensions of ramification order e prime to p over 
‘a p-adic field § is remarkably simple. Our results are fundamentally due to 
two simple theorems, the first of which is 

THEorREM 2. Let § be a p-adic field, e and f be rational integers, 


(10) tT=@-1, =1, BW= 


for a primitive +” root of unity fm. Then every field R of degree ef and ramifica- 
tion order e over § is equivalent to 


(11) R = BWP), P* = tsP;, 


for a rational integer i which we shall call a type number of &. Conversely the 
fields (11) have ramification order e and degree ef over §. 

For by Theorem 1 we may write ® = W(P), P’ = P;tigU for U a principal 
unit of . By Lemma 5 there exists a principal unit V in & such that V° = U. 
Write P) = V'P so that Py is a prime quantity of &, ® = W(Po), Po = 
U"'P® = Pst as desired. The converse is an instance of Theorem 1. Note 
our use of (a, b) for the greatest common divisor of two rational integers a and b. 

We next have 

TuEorEM 3. Let & and & be two fields of the same degree and ramification order 


10 This is a type of Eisenstein theorem. See in this connection p. 74 of [11). 
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e over § (e, p) = 1, and let i and j be a pair of respective type numbers. Then 
Q and 2 are equivalent over § if and only if 1 and j are congruent modulo 


(12) d = (7, e). 
Thus the fields 
(13) R: = Pi=sePs 


give a set of d inequivalent fields of the same degree ef and ramification order e 
prime to p over & to one of which every such field" is equivalent. 

For if & = W(P), = we writed = dk +i (0 Si <d),d = ee +77, 
for rational integers e: and 71, = (¢#)’, = The quantity 
P; = «x P is a prime quantity of 8, R = WP.) as in (13). Conversely 
every field (11) has degree ef and ramification order e over §, this is true in 
particular of the fields (13), the fields (11) are equivalent to fields (13). If & 
has a type number 7 and & has a type number 7 then & and & are equivalent 
over § if and only if & contains a prime quantity Po such that Po) = Petr . 
Now Py) = UP for a unit U of &, = = = Then 
U' = ts. By Lemma 3 U = ¢e, (i = te" which is true if and only if 


(14) j — t = ke (mod 7). 


But this congruence has a solution k if and only if d divides 7 — 1. 

We note the corollary of our proof. 
Png 6. Let S be any automorphism over F of a field K of (11). Then 

= 

For ¢% is a primitive 7 root of unity and is a power of fe, (P*)’ = {@P3 = 
tsP;. Our result follows from the proof above. 

We may however obtain the more explicit result of 
Pn 4. The automorphisms S over F of a field K of (11) carry § into 
where 


(15) = 

for a rational integer p. If then S is any automorphism over § of R we have 

for some integer u and an integer k such that 

(17) ke = i(q§ — 1) (mod 7). 


Conversely the correspondence in & over § induced by (16) is an automorphism S 


7m It is natural at this point to discuss also the subfields of a given R, and the number 
of inequivalent subfields of a given ramification order and residue-class degree. The 
author has studied and solved this problem but has not included the results in the present 
paper because to do so would delay unnecessarily our real goal, the study of abelian fields. 


methods of obtaining these additional results is that of the proof of the present 
eorem, 
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over F of & for every solution k of (17), S exists of and only if d = (e, r) divides 
1). 

For Lemma 6 states that (P*)’ = ¢sP° = ¢is*'Ps while clearly we must 
have (P*)’ = P;(¢%)° for an automorphism S over § of &. It is known that 
cy = Ci for some s, and ¢%*' = fw", (17) holds. Conversely (17) holds for 
some k if and only if d divides 7(gg — 1). It is simple then to verify that (16) 
is an automorphism of ® = Q(P) over §, P’ = P3fw, if and only if P’ = 
(P;tis)* for S inducing an automorphism over § of BW. 


4. Normal fields 


The most interesting algebraic extensions of a field § are the normal fields 
over §. For such fields we have the following existence theorem. 

THEorEeM 5. There exists a normal field R of ramification order e prime to p 
over a p-adic field § if and only if e divides gg — 1, that is, the maximal unramified 
subfield of K contains a primitive e* root of unity. 

Our theorem thus states that there exists a normal field & of ramification 
order e prime to p and degree ef over 8 if and only if there are e fields in the 
set (13) to which all fields with the given e and f are equivalent. We derive it 
as the consequence” of the case i = 0 of 

THEOREM 6. Let & have degree ef and ramification order e prime to p over F, 
K = Q(P), P’ = Pst. Then & is normal over F if and only if e divides both 
gs — 1 and i(gg — 1). 

For W is cyclic over § and its automorphisms have been seen to carry the 
minimum function g(x) = 2° — of P over to g,(x) = 2° — 
Evidently & is normal over § if and only if it contains every root of all the 
polynomials g, (x) in the root field 3 containing & of the product go(x) -- - g,(z). 
In particular ¢P is a root of g(x) in 3, where ¢ is a primitive e“ root of unity. 
Hence if & is normal over § the quantity (¢P)P"' = ¢ must be in &. By 
Lemma 4 this is possible if and only if e divides gg — 1. If then ¢ is in & the 
field will contain all roots of all the g,(x) if and only if & contains a root of 
g.(x) for all wu. Theorem 4 states that this is satisfied if and only if e divides 
i(qs — 1) for all » = 0,1,---,e — 1. But if e divides 7(qg; — 1) it clearly 
divides — 1) = — 1) (1 + + --- + Gf’) for all values of u. 

As an almost immediate corollary we have 

THEOREM 7. A completely ramified field of degree e prime to p over a p-adic 
field § is normal over § if and only if it, as well as all other completely ramified 
fields of the same degree over %, are cyclic over §. Moreover this occurs when and 
only when § contains a primitive e root of unity, there are e inequivalent com- 
pletely ramified fields of degree e over §." 

For if & is normal we apply Theorem 5 with § = BW, f = 1 to see that ¢ 
divides gj — 1, § contains a primitive e root of unity. Every completely 


2 That e necessarily divides gg — 1 follows explicitly from Theorem 6. The existence 
of a normal field is given by the case i = 0 of that theorem. 
18 See also [26], [27] for results of this type. 
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ramified field of degree e over hg is then equivalent to one of the fields R; = §(P), 
= (i = 0, 1,--- ,e — 1), the correspondence P P* = ¢P clearly 
induces an cctaieaenihile of order e over § of R;. Hence every &; of (13) is 
cyclic over 

The result above has the heiioniein rather interesting consequence. 

TurorEM 8. Let e be prime to p, © be the Euler" &-function of ¥ = e(qs — 1), 
{> be a primitive (qs — 1)™ root of unity, Wo = F(t). Then every field R of 
ramification order e and degree ef over § is adding over & to a subfield of the 


normal field 
(18) N = WP) = Wo XK F(P) = (M, Wo), P' = Py, 


of ramification order e and degree e® over §. 

For if 2% = Wo(P) is defined by P* = Py the field N is normal over § by 
Theorem 5 if and only if e divides gj — 1. But & is defined so that W divides 
q; — 1. Hence % is normal over §. Evidently ® is the composite of the 
completely ramified field §(P) and the unramified field Wy, N = Wo X F(P). 
By Lemma 4 Y&® contains a primitive VW‘ root of unity ¢, ¢° is a primitive 
(g — 1) root of unity fg and we may assume that the maximal unramified 
subfield of R is W = F(kwe) S Wo. Nowif P; = Pe’ we have P; in N, Pf = 
P< = Pstis , N contains every RK; in the sense of equivalence. Clearly N 
is the composite of YW and KR. This proves our theorem. 

We shall proceed to determine the automorphism group © of any normal 
field with (e, p) = 1. By Theorem 7 R = W(P) is cyclic over W, the group 
of R over Wis H = [T]. We are taking R = WP), P’ = Psfw and may 
thus assume that 7’ is induced in by 


(19) = Se, =P” = SP, 
where 
(20) g@—-l=e, 


The group § is a normal divisor of index f of G and the quotient group is the 
cyclic automorphism group 


(21) G/H = [SH], 


where S is an automorphism over § of &. Here we may choose S to be any 
automorphism over § of & such that [S$] induces the group of W over §, 
that is any S such that 


(22) 
is a generating automorphism of % over §. Thus we take 
(23) = (Sa)%. 


“ This ® is the number of integers not greater than and prime to¥. We do not use the 
customary symbol ¢ of the Theory of Numbers, as we shall use the notation ¢ for valua- 
tions in a later section. 
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The choice (23) now implies that , 
(24) 89, 


since the cosets (SS)? = S’S are distinct for j = 0, 1,---,f — 1, (SS) 
leaves ¢ unaltered and isin §. Thus there exists a rational integer 6 such that 


(25) =T". 
Also § is a normal divisor of @ and 
(26) TS = ST" 


for a rational integer y. We determine & completely in the following 

THEorEM 9. Let & be normal of ramification order e prime to p and degree ef 
over § so that R = WP), P’ = Pxfwe~. Then the group of & over § consists of 
the ef distinct automorphisms 


(27) ST? (k =0,1,---,f—1,j =0,1,--:,e-1), 
where 

(28) T=I1, TS = ST%, 

and T is given by (19), S is given by (23) and 

(29) —1) =eh, P® = 


We have already a partial proof of this result and to complete our proof we 
need only verify that y = qs, 6 = 7 satisfy (26), (25) if (29) holds and that 
= = Now P™ = = Also 
= ¢*P = = = Hence TS = ST as 
desired. We compute P™ = and see that = 
Hence P” = = P™, Sf = T". 

In the discussion we shall now make of abelian fields we shall require the 
following 

Lemma 7. Lett = (e,%). Then the order of the automorphism S of the group 
in Theorem 7 is eft’. 

For if S has order k then (§S)" = 9, k is divisible by the order f of the 
coset HS, k = fl. Then S“’ = 7" = J. The order of 7" is evidently et 
et’ divides 1. But = = I, k = fl divides eft’, 1 divides et 

k = eft” as desired. 


5. Abelian fields with ¢ prime to p 


Equation (28) implies that S7 = 7'S if and only if gg = 1 (mode). But & 
is abelian if and only if ST = 7S and we have” 


6 The author originally obtained this result for the most interesting case of cyclic 
fields as follows. Let & be cyclic of ramification order e prime to p and degree ef over 4 
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TurorEM 10. There exists an abelian field of ramification order ec prime to p 
over & if and only if all fields of this ramification order are abelian, § contains a 
primitive e root of unity.” 

Thus there are either no abelian fields of ramification order e over § or all 
fields of ramification order e are abelian. Moreover in the latter case for every 
degree n = ef there is a set of e inequivalent abelian fields of ramification order e 
and degree n over § to one of which every abelian field with the given n and e 
is equivalent. We also have the elementary consequence. 

THEOREM 11. Let § be a p-adic field and n be an integer prime to p(qx — 1). 
Then the only abelian field of degree n over § is the cyclic unramified field. 

We continue our study of the structure of abelian fields by proving 

THEOREM 12. Let R be an abelian field of degree n = ef, ramification order e 
and a type number i over §, e be prime to p and define 


(30) 5 = (e, f). 


Then K is the direct product of its maximal unramified subfield W and a cyclic 
completely ramified field of degree e over & if and only if 6 divides i. 
For the congruence 


(31) if = 7 (mod e) 


has a solution ¢ if and only if 6 divides z. Let 6 divide 7, t be a solution of (31) 
Since & is abelian we have gz = 1 (mod e), 


(32) o=lt+at--- =f (mode), 
so that 
(38) tc = i (mod e). 


But then to is a type number of &, R = WP), P’ = Py(ew)! = Page in §F, 
8 = WX ¥(P). Conversely if R = WX F(P) as above we have K = WiP), 
ot is a type number of &, ot = 7 (mod e), 5 must divide 7. 

The type of abelian field considered above is one extreme, the other of which 
is considered in 

THEOREM 13. Let & be as in the hypothesis of Theorem 12. Then & is cyclic 
over § if and only if i is prime to 6. 


For let i be prime to 6, n = pi' --- p?' for positive rational integers g;, distinct 
primes p;, and n = ef the degree of R over §. Then 


p-adic field §, so that = W(P) as in Theorem 3. The automorphism group of R over W 
8 the cyclic group [T'] where T = S/, S is a generating automorphism of & over §, and 
Clearly P? = wP, w is a primitive e‘ root of unity. By Lemma 6 PS = oP for fin B, 
PS’ = PT = Nix (to) P, so that w = is in as desired. 

See also [22] for results of this type. 
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where the a; and @; are rational integers such that 
(35) 290, B20 (j= 1,---,2). 


If a8; > 0 then p; divides 6 and is prime to 7, (e, @) is prime to 6;, e[(e, i)] 
is divisible by By Lemma 7 the order of is divisible by = [S] 
contains an element S; of order p/’. If a; = 0 the integer e is prime to p, the 
order of S is again divisible by p?/ = p4’, S; of order p%/ again exists in [S]. 
Finally if 6; = 0 the group [7] contains an element S; of order pj’ = p7?. The 
order of the product Sy = S; --- S; is n, G@ = [So] is cyclic. Conversely let G 
be cyclic of order n and let p be a prime divisor of all the integers e, f, and i. 
If G = [So] then n = p’no for (nm , p) = 1, S: = So° has order p’ = p*** where 
e = f = p'fo, p) = 1. The order of Sis = pv, (%, p) = 1, the 
order of S”° is p* since m% is divisible by » and is prime to p. But p divides 
(c,i),\ <a+ Thus if = S*7” we have S; = where 7’ has 
order p*, St = 1 if m = p****, a contradiction. 

As final special case we derive 

TuroreM 14. Let & be as in the hypotheses of Theorem 12, and let i = Aw 
where ty is prime to ef and A divides 6 = (e,f). Then R = 81 XK Re where R2 is a 
completely ramified cyclic field of degree A over §, &; is cyclic of degree ef A’ and 
ramification order e over §. 

For e = Aéy,f = Afo. We use the group of & over § as determined in 
Theorem 9 and have S’ = 7) = generates the group [7'], generates 
a subgroup of order é of [7]. By Lemma 7 the order of S is feo = ef. We 
put 7, = ToS %° and have 7} = T'S’ =I. The subgroup of generated 
by S and 7; contains 7S’° = Ty and hence 7, it is the group ©. Its order is 
at most the product of the order Efy of S by the order of 7;. The order of 
T; divides A, the order of © is efyA, T; has order A, © = [S] X [71]. The sub- 
field ®2 of R unaltered by S has intersection § with W, 2 is a completely rami- 
fied field of degree A over §. The intersection of the subfield &; of & unaltered 
by 7; with QW is the set Wp of all quantities of KR unaltered by both 7; and 7. 
Hence YW is the subfield of YW unaltered by 7 )71' = s/ ° Wo is a field of degree 
fo over §. Now &: is cyclic of degree foe and residue-class degree fy over 3, 
the ramification order of R, over § is e. 

We shall combine the results above by the use of 

THroreM 15. Let & be an abelian field of degree n = ef, ramification order e 
prime to p and a type number i over §. Suppose that n = ab for relatively prime 
a and b so that 


e=¢e, f=fife, a=afi, b= 
for rational integers e; , e2, fi, fe. Then & is the direct product 


R= LK M 
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where © and M are abelian of respective degrees a and b and ramification orders 
e, and e2 over §. Moreover if we determine rational integers a and B so that 

1(mod a), Bhi 1(mod b), 
then % has a type number at and It has a type number Bi over §F. 


For & = W(P), P’ = Ps, and e divides gg — 1. Then e divides gf' — 1 
and we put 


so that W, = §(t) has degree f; over F. Clearly 
(37) o = 1+ + --- + = fo(mod e). 


Hence afp = ac, = 1(mod e), ac; = 1(mod e;), and there exists a rational 
integer such that 


(38) acy = 1 — ye). 

We put 

(39) P, = 

and have 

(40) PP = Py = 


The field £ = YW,(P:) has degree a = e,f;, ramification order e;, and a type 
number at over §, e divides gg — 1, is abelian. Similarly we obtain an abelian 
subfield IN of degree b = eef2 , ramification order eg and a type number fi over 
§. The composite in & of its subfields & and M is their direct product of degree 
ab = nover R = M. 

Our final theorem on the structure of p-adic abelian fields is now stated as 

THrorEM 16. Every non-cyclic abelian field of ramification order prime to p 
over a p-adic field § is the direct product of two cyclic fields one of which may be 
laken to be completely ramified over §. 

For if 8 is non-cyclic the greatest common divisor A of e, f, and @ is not unity. 
Then we may write 


(41) e=et, f=hifi, 


where ¢ is the product of all prime factors of e not dividing A, fo is the similar 
product for f. Then (éofo, e:f1) = 1, every prime factor of ef; divides A, that 
is, divides e, ,f,,and7. By Theorem 12% = & X M where Gis abelian of degree 
efi, ramification order e; and a type number ai over §, afo = 1(mod eif;), @ is 
prime to e,f;. Hence the greatest common divisor of e , fi, and atis A. Simi- 
larly IN is abelian of degree eof and ramification order é over §, 8 is prime to 
“fo, 2 type number of M is Bi prime to eofo , Mis cyclic by Theorem 13. There 
remains the discussion of the structure of &. 
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If p is any prime factor of ef; we have determined e; and f; so that p divides 
and 7. We put the prime factors of into two classes. Let the first 
class consist of those primes p for which the highest power of p dividing 6 = (e, f) 
divides 7, and the second class all remaining primes. Then if p is in the second 
class and ¢; = p“e2, fi = p’fo,1 = pie , we have chosen p so that the minimum 
of \; and de is greater than 43. We may then write 


(42) a= ets, fi=fsfs, 
where 
(43) (esfs eafs) = 1, 


ésfz is the product of primes of the first class, e,f, of primes of the second class. 
Clearly (e3 , fs) divides 7, 8i = At where % is prime to ef; and A divides (e , fy). 
Then by Theorem 14 


(44) L= XK 


where &; has degree ef; , ramification order e; , and a type number a;i over §. 
By Theorem 12 %; = YW; X KR; , Ws is unramified of degree fz over §, Rs is com- 
pletely ramified of degree e; over §. By Theorem 13, % = & X 4 over §, Rs 
is completely ramified of degree A over §, %; has ramification order e, and residue- 
class degree fA’ over §. Thus 


(45) RK = (M Ws K & (Vs K Rs) 


where I X Ws X & is cyclic over F, % X Ky is a cyclic completely ramified 
field. This proves our theorem. 

In closing we note 

THEOREM 17. Let e and f be rational integers, (e, p) = 1, and write 5 = (e, f), 
e = be. Then every cyclic field of degree ef and ramification order e over a p-adic 
field § is equivalent to one of a set of e&(6) such fields inequivalent over §, where 
is the Euler -function. 

For we have seen that every abelian field is equivalent to one and only one 
of the fields with type numbers 7 = 0, 1, --- , e — 1 and these fields are cyclic 
if and only if 7 is prime to 6. There are (5) numbers not greater than 6 and 
prime to 6 and if a is one of these then a, a + 6, a + 26,---,a+ (e — l)éare 
complete set of integers in the set 0, 1, --- , e — 1 which are congruent to 4a 
modulo 6. Evidently every desired integer i is congruent to an a modulo 6, 
there are e&(5) such integers. 


6. Structure of arbitrary fields 


Our first results on the structure of fields whose ramification order is arbitrary 
is the rather surprising 

TueoreM 18. Let & be a field of degree ef and ramification order e over a p-adic 
field & and let 


(46) e=p'e,  (@,p) =1. 
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Then & is completely ramified of degree p’ over a subfield & of degree eof and rami- 
fication order over 
For, by Theorem 1, 8 = W(P), P’ = UP;, for a unit U of R. We write 
) = (Uo where Up is a principal unit of 8. By Lemma 5 U,) = V for a prin- 
cipal unit V of &. Pub & = W(R), R = VP? and have R® = V°P* = 
Up teUoP = , has degree eof and ramification order over §, = (P), 
p” = VR, & is completely ramified of degree p’ over &. This completes our 
roof. 
; If W is the maximal unramified subfield over § of a field R over § and Wo 
is any unramified extension of Y& the composites 


(47) Ro = (RK, Wo), Yo = (¥, Wo) 
over § are clearly the direct products 
(48) Ro = KR XK Wo, % XK Wo 


over YW. Hence the ramification order of &o over § coincides with that of R 
over §, %o is the field of Theorem 18 defined for &). We may then prove 

THeorEM 19. Every field R of finite degree over a p-adic field is contained in a 
field Xo of the same ramification order as R over § and such that the field %& defined 
in Theorem 18 for &o is normal over §. Moreover if R is normal over § the field 
Ro may be taken to be normal over &. 

For we choose {Wp as in Theorem 8 and have Ly = & XK Wo normal over ¥F. 
The composite of a normal field R over § and a cyclic field Wp over F is normal 
over 

We use the results obtained thus far to obtain a new proof of Hensel’s” 

THEOREM 20. The automorphism group of a normal field & over a p-adic field 
3 is a solvable group. 

For we use the field &o of Theorem 19 to see that & is contained in 
Xo = % S Wo = F, where Kp is normal of degree p” over % , Yo = Wo(P) such 
that P*® = Pg, in Wo , & is normal over Wo , Wo is cyclic over §. The group 
of Mo over % is a p-group and solvable, &> is metacyclic over %. The field &% 
is completely ramified and is cyclic over Wp. Hence Rp is metacyclic over §, 
the group Gp of &o is solvable. The normal subfield & of &, corresponds to a 
subgroup of Gp, the group of over is = G/H which is homomorphic 
to Gp and hence is solvable. 


7. Finiteness of the number of fields 
If ¢ is the ramification order of a p-adic field § over 2, we define 
(49) y=ety, 
where y is the greatest integer in the rational fraction «(p — 1)’. We let 


See [16] in this connection. 
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a;, +++, @, range independently over 0, 1, , so that there are 
H gs principal units in the set of principal units of the form 
(50) V = 1+ + + --- + 


every principal unit of § is congruent modulo pj" to one of the units (50). 
Then we may prove 

Lemma 8. Every principal unit in § has the form U = UV, where Uy is a 
principal unit of § and V is one of the g§ quantities (50). 

For every principal unit has the form U = V + P3*'B where B is an integer 
of §. Then FE = UV = 1 + P}"D with D = BV™ an integer of $. The 
a congruence y” = E(mod pg*’) has the solution y = 1. By a known result” 

there exists an integer Up in § such that US = E. Now U? = 1(mod pj"), 
| (p) = ps, (Uo — 1)? = Ut = O(mod p§), Uo = 1(mod pg), Uo is a principal unit 
ih of §, U = VU? as desired. 


H We shall use the result above in a discussion of the finiteness” of the number 
of p-adic fields of a given degree. Note that every subfield of a normal field 
NM corresponds uniquely to a subgroup of its automorphism group. This group P 


i is a finite group, it has only a finite number of subgroups, Jt has only a finite 
TE ae number of subfields. This is evidently also true of any subfield of Jt and hence 
atta BA every field of finite degree over § has only a finite number of subfields. 


HH . We now let & be cyclic of degree p over a p-adic field §, ¢ be a primitive f 
| p root of unity, be the cyclic field §(¢). Then it is well known that the 
| abelian field % = &(¢) = %o(é) where & = ain Buta = fora 

H H principal unit U of % and Lemma 8 states that U = U¢V for V given by (50). ( 


We write \ = + r for 0 r < p, use fs, = for k = gg,p and have 
a = Then & = Fo(n) where » = n” = P%,V is one of a finite 
Bae number of quantities of Fy. Hence & is equivalent to a field contained as a 


subfield of one of a finite number of abelian fields over §. . 
The result just proved combined with Theorem 16 states that there are only a 
finitely many inequivalent cyclic fields of any prime degree p over any p-adic e 
field §. If & is normal of degree n over § Theorem 20 implies that of 
R= Ri > Ra >--- > Ki > Ko = F where RK; is cyclic of prime degree p; of 
over = 1,--- = pr, Reis equivalent to one of a finite number 
of inequivalent fields of degree p; over %;1. But then there are only a finite ee 
number of such fields & in the sense of equivalence. Our earlier argument then “ 
THEorREM 21. The number of inequivalent fields of a fixed degree over a p-adic . 
field & is finite. lat 
8. The Grunwald existence theorem the 

no 

The very important theorem of Grunwald on the existence of abelian fields , 
with prescribed local properties has never been stated, in the literature, In 4 “on 

18 See Theorem 1 of [29]. a 


19 See footnote 5. ry 
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form most convenient for applications to the theory of rational division algebras 
which motivated its discovery. We shall analyze the theorem here and obtain 
an equivalent form free of the terminology of class field theory and much more 
useful for the study of algebras. 

We let §* be the set of all non-zero quantities a of an algebraic number field 
}, G be a finite abelian group of order n, and ¢,, --- , ¢, a finite number of non- 
archimedean valuations of §. To each ¢; there corresponds a unique prime 
ideal p; of § defining ¢;. For each j we assume the existence of a function 
x(a) on §* to G such that x;(aia2) = x;(a1)x;(a2). We also assume that 
there exists a power, and hence a least power ©; , of p; such that if a = 1 (mod €,;) 
then x,(z) = J. The Grunwald theorem states” that then there exists a field 
R of degree n and automorphism group @ over §, such that the norm residue 
symbol 


(5 (=) = 
Pj 

for every a of §*. Moreover if § ranges over all real conjugates of § and n is 
even we may prescribe the corresponding conjugates ®” over § to be either 
real or imaginary as desired. 

Let now §,; be the derived field of § with respect to ¢;, 3; be any abelian 
field over §s; with automorphism group I;  cenieaeen to a subgroup G; of G. 
The norm-residue symbol 


is defined in the local class field theory as a function on the set Be, of all non- 
zero quantities of §»; toT;. If a is in §* we define x;(a) to be the element of 
§; corresponding to (52) under the given equivalence of T; and @;. Then it 
is known in the local class field theory that x,(a) has the properties in the 
hypothesis above of Grunwald’s theorem and we choose it to be the function 
of that theorem for j = 1, ,r. But 3; is uniquely determined in the sense 
of equivalence as the local ‘class-field corresponding to the class group H of all 
quantities a of e; such that (52) is the identity element of T;. Since the com- 
posite R; of R and §%,; corresponds to the same class group © as does 3; it is 
equivalent over §; to 3;. If ¢ is any archimedean valuation of § and 3 is 
4 field over §, such that the group of 3 over §, is equivalent to a subgroup of 
%, then either 3 = %, or G has even order and 3 is quadratic over §. The 
latter case can occur only when %, is real. Grunwald’s theorem then states 
that we may take the composite of & and %% to be equivalent to B,. We 
how state our result as 

THroreM 22. Let § be an algebraic number field, ¢:, --- ,¢, be any finite 
number of valuations of §, @ be an abelian group, 3; be a field over the derived 
field §4; of § with automorphism group equivalent to a subgroup of ®. Then there 


20 See [7]. 
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exists a field R with automorphism group © over § such that the composite of & 
and §4,; is equivalent over &; to 3;. 

That there exist infinitely many fields & follows from Grunwald’s proof, 
However we may better regard this result as a corollary of our theorem which 
we state as the 

Corotuary. There exist infinitely many fields R inequivalent over § and 
with the property of Theorem 22. 

For if t is any integer and p is any prime divisor of the order of the group 
( there exist non-archimedean valuations ¢,41, --- , 42 all distinct from the 
¢1,---, and fields unramified of degree p over Weapply Theo- 
rem 20 to obtain an abelian field &;, such that , 5s) = Wi, (Mk, = 
| 3s,., for? # k. Evidently the fields &, are inequivalent over § for distinct 
values of k. 
vs Our Theorem 22 indicates that the Grunwald theorem should be regarded 
merely as a result which allows us to obtain a field over § with a prescribed 
. abelian group and prescribed composites with a finite number of derived fields 
i of §. Its hypotheses should not contain any conditions on the existence of 
beara such composites but this should be left as a problem of the structure and exist- 
He ence of algebraic extensions of p-adic fields. 


9. Splitting fields of rational division algebras 


If D is any normal division algebra of degree n over an algebraic number 
Mid ‘| field 8 and 3 is any cyclic field over & then it is known” that 3 splits D if and 
Bi " only if the degree over &, of the composite of 3 and &, is a multiple of the index 

id of D X R, for every valuation @ of KR. Our form of the Grunwald theorem 
then implies 

THEOREM 23. Let D be a normal division algebra of degree’n over an algebraic 
number field of finite degree over §, and let , , be all the non-archimedean 
valuations of R for which the index of D XK Ky; ts m; > 1. Then there exists a 
cyclic field 3 of degree n over § whose composite with R splits D if and only if there 
exist cyclic fields 3; of degree v; over § such that v; divides n, the composite of 
3; and has degree a multiple of m; over Ry; ,j = 1, 

For, by Theorem 22, there exists a cyclic field 3 of degree n over § such that 
the composite of 3 and §%; is equivalent to 3; , the composite of 3 and §» 


is algebraically closed for every archimedean valuation ¢ of §. Let 3e be the 
composite of 3 and &. Evidently the degree of (3g, Rs) is a multiple of the i 
index of D X K, for every archimedean valuation ¢ and every non-archimedean ( 
¢ ~ ¢;. Now the degree of (3¢, &y;) = (3, Rs;) = (3;, is a multiple i 
of m; as desired. I 
Let us discuss some consequences of the condition of our theorem. We ob- I 
serve first that we have p 
THEOREM 24. Let D be a normal division algebra of degree n over tts centrum 
if 
T 


21 Cf. [4], Chapter IX, 


nit 
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Q which has degree q prime to n over an algebraic number field ®. Then there 


exists a cyclic field 3 of degree n over § such that the composite of 3 and & splits D. 


For we choose all the integers »; of Theorem 23 to have the value n. The 


degree of Kp, over Fe; is a divisor of g and prime to n, the composite (3; , &5,) = 
3; X Rg; has degree n over Ry; , m; divides v; as desired. 

We next prove 

THEOREM 25. Let D be a normal division algebra of degree n = 2v over a field 
R of degree 2u over an algebraic number field § and let u and v be relatively prime 
odd integers. Then there exists a cyclic field 3 of degree n over § such that the 
composite of 3 and & splits D. 

For if the maximal unramified subfield W; of &,; has odd degree over 5, 
we choose the cyclic field 3; of Theorem 23 to be unramified of degree n over 
; of the composite of 3; and &,; is their direct product and has degree n 
over Ry,. If, however, W; has even degree the composite 9); of W; and the 
34,(R;), Rj = P; the prime quantity of §%, , is the direct product W,(R,) of 
degree two over W;. The degree of Ry; over W; divides yw and is odd, the 
composite R;(R;) of Y; and Rs, has degree two over Ry;. We now let 3; 
be the composite of §»;(#;) and the unramified field of degree v over 4; , this 
latter field has degree over §; prime to the degree two of &4,(R;), the com- 
posite of 3; and Sy, has degree n over Ry;. Hence (3, &) splits D. 

The result above is a generalization of the case where D is a generalized qua- 
ternion division algebra over a quadratic extension of the field 9 of all rational 
numbers. As we indicated in our introduction it is this case that inspired the 
present article. 

The field 3 may not exist in the remaining types of degrees n and q, and we 
prove the following existence theorem. 

THEOREM 26. Let n and q be rational integers which are neither relatively 
prime nor the doubles of odd relatively prime integers. Then there exists a field 
R of degree q over the rational number field R and a normal division algebra of 
degree n over R such that D is not split by the composite of R and any cyclic field 
of degree n over R. 

For let p be a prime dividing both n and q, n = p’n for (m, p) = 1. Then 
if 3 is cyclic of degree n over ® the field (3, &) splits D if and only if (3, , ) 
splits D, , where 3, is the subfield of degree p’ over & of 3, and D, is the direct 
factor of D of degree p” over R. Hence we may assume, without loss of general- 
ity, that n = p” where p is a rational prime dividing g, and that if p = 2 then 
either » > 1 or q is divisible by four. We now define a rational prime p. If p 
is odd we let p be a prime of the form p = pt + 2so that p — 1 = pi + 1is 
prime to n. If p= 2 but »>1 we let p = be a prime, and 
have p — 1 = 2(2”"t + 1) not divisible by 4. In either case Theorem 8 im- 
Plies that the only cyclic field of degree n over §, is the unramified field. 

We shall construct an abelian field & of degree q over ® and use the notation 
» for a prime ideal divisor of p in &, ¢ for the corresponding valuation. Apply 
Theorem 23 to construct a field & which is cyclic of degree g over # and such 
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that Ry = (R, Rp) is unramified over ,. Then the composite of Ry and any 
cyclic field of degree n over 8, cannot have degree n over &4. 

Let next p = n = 2, q = 4r for an integer r._ We let p by any odd prime, 
R be an abelian field of degree g over R chosen so that Ry = (RK, R,) is the 
composite of an unramified field of degree 2r over ®, and the field %,(P), 
P* = p. Then &, contains a primitive (p’ — 1)" root of unity f, p? — 1 = 
(p — 1)(p + 1) = 2A(p — 1), (¢3)” = ¢ is the primitive (p — 1)" root of unity 
in Rp. Hence Ry contains ¢>P such that (OP)? = tp. By Theorem 3 Ry 
contains all quadratic fields over ®, . 

We have made the argument above to prove the existence of a field & of degree 
q over ® and a valuation ¢ of R such that the composite of &, and any cyclic 
field 3 of degree n over R cannot have degree n over Rs. By Theorem 22 
there exists a field 9) which is cyclic of degree n over R such that (9), R,) is 
unramified of degree n over R,. We may choose a quantity y in & which 
has order one with respect to ¢. Then D = (9, S, y) is a cyclic algebra such 
that D X RK, is a division algebra. Evidently (3, &,) cannot split D K &,, 
(3, R) cannot split D. 

We close our results with a proof of 

THEOREM 27. Let & be an algebraic number field of degree q over R, D be a 
normal division algebra of degree n over R. Then there exists a cyclic field 3 of 
degree nq over R such that the composite of 3 and & splits D. 

For let ¢1, --- , ¢- be the non-archimedean valuations of ® such that the 
index of D X Ky; ism; > 1. We let Ry; be the corresponding derived fields of 
R contained in Kz; and choose a cyclic field 3 of degree ng over R such that 
(3, Rs;) is unramified of degree nq over Ry; and the fields (3, My) are alge- 
braically closed for every archimedean valuation ¢ of 9. Then the degree of 
(3, &s;) is ngr;* where r; is the degree over Ry, of the intersection of Ry, and 
3. Hence r; divides g, ngr;' is a multiple of n and of m;. It follows that 
(3, X) splits D. 
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| POLYNOMIALS IN SEVERAL VARIABLES’ 
By Rurvus OLDENBURGER 
) (Received June 5, 1940) 
1. Introduction 
i Sylvester, Hilbert, Richmond, and Bronowski’ (I-IV) have concerned them- 
ey selves with certain very special problems on the expressability of forms of degree 
p as sums of p** powers of linear forms with coefficients in the complex field of 
numbers. The class of forms which can be written as such sums for fields 
other than the complex field is quite limited. 
It has been proved (V) that each quadratic form Q = = is 
i equivalent under non-singular linear transformations in a given field K to 
yi + Aye + --- + AY? where r is the rank of Q. This is equivalent to the 
i statement that Q can be written as a linear combination \,Li of squares of linearly 
, independent linear forms L, , Lz, --- , L, with coefficients in K. Recently the 
ny author determined (VI) when a form F of degree p can be written for a field K 
i | : as a linear combination of p** powers of linearly independent linear forms. The 
fk fil! 4 form F is then equivalent to a canonical form 


(1.1) Myt + + Ayr 


tt with diagonal matrix of coefficients. Such forms were termed non-singular with 
ol Sea respect to K. Not every form F of degree p is non-singular for a field K. We 
f Ea are thus led to consider conditions under which a form can be written for a 
ty | Wa field K as a linear combination of p‘” powers of linear forms. Each form which 
can be so written can be written as 


Ly = 1,2,---, a), 


where the matrix (a;;....) of coefficients is symmetric (that is, the element 

a;;.... is equal to each element obtained from a;;.... by permutation of the sub- | 
scripts). It is therefore to be understood throughout this paper that each form ' 
has a symmetric matrix of coefficients. This restriction is necessary since, for 
example, the form zy has no symmetric matrix of coefficients for a field with 
characteristic two. 


The basic result of the present paper is that a form F of degree p can be written 
as a linear combination of p powers of linear forms if the field K of coefficients ( 
is of order p or more. Call such a linear combination involving a minimum 
number of terms a minimal representation of F for K. Also call the number PF 

1 Presented to the American Mathematical Society, October 30, 1937, and April 13, 1940. P 

: * The reader is referred to papers I-IV listed at the end of this article. Throughout ; 

Th the present paper Roman numerals will be used to designate references in the bibliography. t 
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of terms in a minimal representation the minimal number of F for K, and denote 
it by m(F). Then the number m(F) is tnvariant under non-singular linear 
transformations on the variables of F, where the coefficients in these trans- 
formations are in K. We shall be concerned with the study of the range of the 
minimal number [in particular for the class of binary forms], the various minimal 
representations of a form, the minimal numbers of algebraic combinations of 
forms, and the equivalence of forms. 

Evidently the majority of the theorems in the present paper may be stated 
with slight modification for non-homogeneous polynomials. 


2. Definitions 


If a form F can be transformed by a non-singular linear transformation with 
coefficients in a field K to a form with r variables, and F cannot be so trans- 
formed into a form with less than r variables, F is said to be a form in r essential 
variables, and we say that “r of the variables of F are essential.”” We arrange 
the elements of the symmetric matrix (a;;....) of F in an ordinary 2-way matrix 
A; with 7 as row index and columns associated with j, --- ,k. A typical column 
of As is of the form 


Anj eee k 


It can be proved (cf. VII) that the rank of the matrix Az is equal to the number of 
essential variables in F. Hence the number of essential variables in a form F 
can be determined in a finite number of steps. In the case of a quadratic form F 
the minimal number of F is obviously identical with the rank and number of 
essential variables in F. 

If we can write a form F as 


(2.1) ML? + , 


with coefficients in a field K, we shall say that (2.1) is a o-representation of F 
with respect to K. 


3. Remarks on forms with symmetric matrices of coefficients 


Each form F of degree p in n variables has a symmetric matrix of coefficients 
if and only if the characteristic of the underlying field K does not divide the 
coefficients C(p; r, s,--- ,t) = p!/ris!.-. d! in the expansion of (a1 + 22 + 
‘++ +2,)". If F and G denote two forms of degree p, and F = G for all values 
of the variables in a field K, corresponding coefficients in F and G must be 
equal if and only if K is of order (p + 1) at least. It follows that a form F of 
degree p in n variables has a unique symmetric matrix of coefficients if and only 
if the field K is of order (p + 1) at least, and the characteristic does not divide the 
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integers C(p; r, s,---,0). If n > 2, this matrix is unique if and only if the 
characteristic of K is greater than p. If m = 1, the matrix of coefficients is a 
single element and is always unique. For a field for which the symmetric 
matrices of forms of degree p in n variables are unique, the symmetric matrices 
of forms of degree p in m, m < n, variables are also unique. In the problem of 
equivalence of forms we shall need these uniqueness notions. 


4. Representation of forms 


We let N denote the number of distinct terms in the expansion of 
(x, + a + --- +-2,)”. The existence of representations is established in the 


following theorem. 
THrorEM 4.1. For a field of order p or more the minimal number m(F) of a 
form F of degree p in n essential variables exists, and satisfies the inequalities 


ns mF) SN. 


That m(F) = n is obvious. We consider the general form F = 
+++ Lm, Where A = (4;;...m) is symmetric, the p indices 7, 7, --- , m 
range over 1, 2, --- , n, and the elements of A are indeterminates over K. Now 
F = ML? + + + robe , if 


(4.1) 


a=1 


where we have written Le = baiti + + Dantn. We may write 
(a1 + a2 +.---+ as a sum 


where the a; are integers, and the f; are distinct power products of degree p in 
11, %2,--+,2,. We let b; denote the set of elements (bi, bie, --- , bin) for 
each value of i in the set 1, 2,---, 0. The system of equations (4.1) with 
o = N is then equivalent to the set 


N 
(4.2) folba)re = (8 = 1,2,---,N), 
where yi, Y2, --* , Yyw are equal in some order to the independent elements of 
A. If we show that the b, can be chosen in K so that the determinant 
| Dy | = | fa(ba) | 


is not zero, there exist solutions for the \’s in (4.2), and hence an N-representa- 
tion of each form F of degree p in n essential variables. 

We suppose that the f’s are labeled so that f:(b:) = bf. Since fi(b:) is a non- 
zero polynomial we can choose 6b; so that f;(b:) # 0. The maximum possible 
rank of Dy is thus at least 1. We assume that the maximum possible rank of 
Dy is M < N. By rearranging the fg , if necessary, we may assume the exist- 
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ence of , bu such that | Du | | f(ba) | # 0, [a, 1, 2, M}, and 
that | Dusi| = | fe(ba) | = O la, B = 1, 2,---, M + 1] for this choice of b, , 
by, , bw and any buy:. Then the rows of | Dw | are linearly independent, 
and the row fu4i(b:), , is a linear combination of them. 
Hence there exist \; in K such that the function 
M 

= fusi(z) — p> Asfa(z), (x) = (a1, 22, +++, tn), 

vanishes for z = b1,2 = be, ---,2= by. If K is of order = (p + 1) the form 


f(x) is a non-zero polynomial. Thus there exists.a bay: such that. f(bw41) # 0 
in K. For this choice of by 1 we have 


Du 
O 


giving a contradiction. Hence M = N, and m(F) S N. 

If the field K is of order p, then p =r’, where r is a prime. Thus 
F = wat + mee? +--+ + , whence m(F) = 

If F is non-singular, m(F) attains the lower bound n. It is remarked that 
the terms in a minimal representation of F can be grouped to give F = 
Fi + Fo + Fy, where --- , are non-singular forms. 

It follows from the proof above that each form F of Theorem 4.1 has an 
N-representation involving the same linear forms, these representations differing 
only in the values of the 2’s. 

Corotuary 4.1. For a field K of order p or more, a forms F of degree pis a 
linear combination of p** powers of linear forms. 

The minimal number depends on the field K as the following example shows. 

We write Q = 62°y’. Now m(Q) = 3 if K contains an element w ¥ 1 such 
that o = 1, and the characteristic of K is not 2 or 3. If K does not contain 
such an w and has the same restrictions on the characteristic, Q has the minimal 
representation 


| | = | Du ¥ 0, 


whence m(Q) = 4. 


5. Remark on existence of representations 


The existence of representations of the form F of Theorem 4.1 can be proved 
more simply than in Section 4 by first showing that C(p, q)x” “y* has a (p + 1)- 
representation. Assuming that forms of degree p in less th n variables can 
be written as linear combinations of p‘ powers of linear forms, one can show 
that forms of degree p in n variables can be so written. The existence of 
representations now follows by induction. ‘This type of proof does not yield as 
much information about the minimal number as the proof given in Section 4, 
and is therefore omitted. 
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It can be readily shown that Corollary 4.1 cannot be strengthened to fields 
of order less than p. 

Corollary 4.1 implies that, for each integer r that divides p and field K of 
order p or more, F can be written for some value of o as 


MHi + + 
where the H; are forms of degree p/r. 
6. Uniqueness 
| The following theorem is valid for each field K for which the symmetric 


matrix of RF is unique. 
THEOREM 6.1. If 


(6.1) R = + + + 


f is minimal, the d’s are uniquely determined by the L’s. 
it Let F and L,, --- , L, be given as in Section 4. The condition that F and R 
be identical is given by (4.1). If the \’s were not unique, one or more would be 
Met iel i arbitrary, whence we could choose some of the \’s equal to zero. In a minimal 
iF | represeritation R the \’s are not zero, whence the )’s are unique. 

"it Hi It follows that the matrix M of coefficients of the \’s in (4.1) is of rank co. 


| | 7. Notation 

wi AE | If F has a representation (6.1), we shall say that F has a representation [X, B), 
where denotes the set A2, --- , Ae], and B is the matrix of coefficients of 
i a the L’s. The form F is non-singular if and only if B is non-singular. 

| The matrix B in a minimal o-representation [A, B] of a binary form F of 


wit ti But degree p, can always be written as 
B= DC, 
where 
0 1 1 Bi 
p2 1 Be 
D= ’ C= 1 Be or 
B 1 Bo-1 


and the #’s in C are distinct. Hence, F has a representation [u, C] where 
= , , dope]. It is no restriction in what follows to treat only 
minimal representations of binary forms where these representations are of the 


type [u, C]. 


8. Binary forms 


In this section we shall study the range of the minimal number for forms in 
two essential variables. 
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We assume throughout this section that the field K is such that each form con- 
sidered has a unique symmetric matrix of coefficients. 

Lemma 8.1. The minimal number of ry* is q + 1. 

Equating walz + Bayl*' [a = 1, 2, --- ,q + 1] to zy’ we obtain a system of 
(q + 2) non-homogeneous linear equations in the u’s which can be solved for 
the »’s if and only if there exists a solution for the y’s in the system of homoge- 
neous linear equations 


1 
(8.1) Bene = 0 ll, 


where the 6’s are distinct. The determinant of coefficients in (8.1) is a product 
of the differences between the 6’s by the sum 


Sout = Bi + Bo +--+ + Bou. 


It has been proved elsewhere [XII] that there exists a vanishing sum S,4; 
unless the characteristic of the field K is two, and (q + 1) = 2o0r (q+ 1) = 
n — 2, where n is the order of K. By assumption, if (¢ + 1) is even, the char- 
acteristic of K is not two. If (q¢ + 1) is odd, and the characteristic of K is two, 
the order of K is even or infinite, whence (¢q + 1) # n — 2. Thus in any case 
there exists a vanishing sum S,4; of distinct 6’s, and m(zy*) S q + 1. 

If we equate ay! to Bay)" [a 1, 2, ql, or to Ma(t + Bay)" + 
uy = 1, 2, --- ,q — 1], where the are distinct, we obtain inconsistent 
equations in the yp’s. By Section 7, m(ry*) = q + 1. 

THEOREM 8.1. For the class C of binary forms F of degree p the minimal number 
ranges over 2, 3,---,por2,3,---,p+1. 

Since we are dealing with forms in two essential variables m(F’) 2 2. 

By Lemma 8.1, we can write 


(8.2) ay” = AL? fi=1,2,---, ph. 
Since the p-representation in (8.2) is minimal any partial sum \,L? [¢ = 1, 
2,--- , r] of (8.2) is minimal. Hence m(F) takes on the values 2, 3, --- , p for 


forms Fin C. By Theorem 4.1 m(F) S p + 1, whence Theorem 8.1 is proved. 


9. Overlapping of ranges 


The ranges of the minimal number for classes of forms in n essential variables 
are related. 

THEOREM 9.1. The range of the minimal number for the class C’ of forms of 
degree q in n essential variables overlaps the range of the minimal number for the 
class C of forms of lower degree p in n essential variables provided the characteristic 
of the field does not divide p + 1, p + 2,---,@. 

To prove this we shall use the operation of differentiation defined in the 
usual way for polynomials without reference to limit processes. For a form F 
in C, we have m(F) = o, whereao =n +p. Now F = [a = 1,2, --- , al, 
Where the L’s are linear forms. Since for each a we have L, # 0, we can trans- 
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form F non-singularly to a form F’ = \.M@ in variables x; , 22, --+ , 2, , where 
the coefficient of x; in each M, is not zero. We can write F’ = X1.Q? , where 
the coefficient of x; in each linear form Q. is unity. Then Q. = x + N,, 


where JN, is a linear form in 22, 43, --:,2%,. We introduce the form 
(9.1) G = + [a = 1,2,... ,o}. 
Evidently, 

aG 


(p + 1)F 


By (9.1), we have m(G@) S oc. If m(G) = p < ao, 
G = la = 1,2,--- — 


whence 0G/dz; yields a (o — 1)-representation of Ff’. Thus m(G) = o. This 
proves the theorem. 


10. Minimal number of a sum of forms 


We shall need the following obvious lemma. 

Lemma 10.1. Let k be a non-zero element of a given field K. The minimal 
numbers of forms kF and F, with respect to K, are equal. 

THEOREM 10.1. The minimal numbers of forms F, G, and F + G for a field 
K satisfy the inequalities: 
(10.1) m(F) + m(@) m(F + GY = | m(P) — m@)|. 

We note that the existence of m(F) and m(G@): implies the existence of 
mF + @). The left.inequality in (10.1) is obvious. Label the forms F and G 


so that m(F) = m(@). By Lemma 10.1, we have m(—G) = m(G)._ The left 
inequality in (10.1) yields 


mF + G) rs m(—G) = m(F), 


whence the theorem is proved. 

There exist forms F and G for which the equality signs in (10.1) are attained. 
If F and Ginvolve m(F) and m(G) essential variables respectively, and the 
variables of F are aera of the variables of G, m(F + @) = m(F) + m(@). 
If G = —F, 


mF + G) = | m(F) — m(G)| = 0. 
Corouuary 10.1. Adding a term aL”, where L is linear, to a form F of degree p 


at most changes the minimal number by one. 


In the following theorem we give conditions under which a term aL” can be 
added to a form F to increase the minimal number of F. The field of coeffi- 
cients is arbitrary. 

THEOREM 10.2. Let F be a form that if [a = mini- 
mal representation of F, the expression [a = 1, 2, is also minimal. 
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If N is a linear form linearly independent of L,, Le, ---,L, and w # 0, the 
minimal number of F is increased by addition of uN” to F. 
A o-representation of F + »N” can be written as 


(10.2) F uN” AalLa + B.N]’, la 1, 2, 
where La = Gajt;(j = 1,2, --- , mJ, the variables 2, , x2, --- , x, being essential 
variables of F. Then A\.La [a = 1, 2, --- , o] is a minimal representation of F’. 


By assumption \.L2' is a minimal representation of a form G. We can write 
(Gf aS +++ 2m, Where the matrix (a;;...») is symmetric. We let M 
denote the 2-way matrix (@ei@a; -+- Gam) [a not summed] with a as column 
index, and (7, j,--- ,m) associated with the rows of M. By Section 6 the 
matrix M has rank o. Equating coefficients in (10.2) we obtain 


(10.3) Gam = [a = 1,2,--- , 


Since M has rank o, the only possible solutions of (10.3) are 
= = --- = = O. 


Since \.l2 [a = 1, 2, --- , a] is minimal, the )’s are not zero, whence 8, = 0 
foreach a. Setting the z’s equal to zero in (10.2), we obtain uN” = d.(8,N)” 
le = 1, 2,---,o], whence we have arrived at a contradiction. Thus 


m(F + uN’) = m(F) + 1. 
11. Minimal numbers of products of forms 


In a polynomial P,(8) = cB" + + --- + + we term ¢; the 
i" coefficient. If for a field K there exists a polynomial P,,(8) of degree n with 
distinct zeros and with 7 coefficient zero, we say that the field possesses the 
property P,,;. By Theorem 4.1 for any field K m[C(p + gq, p)x’y"] S 
p+q+ 1. A stronger result is stated in the following lemma where the 
field K is understood to be such that the symmetric matrix of x”y’ is unique. 

Lemma 11.1. The minimal number of F = xy’, p = 2, is less than 
(p + 1) of and only if the field K has the property Py¢,p - 

Equating F to 


R = pa(t + Bay)” [fa = 1,2,--- ,p+q] 


we obtain a system of linear equations in the y’s which can be solved for the 
v’s if and only if there exists a solution for the y’s in the system 


(11.1) =0 


where the ’s are distinct. The determinant of coefficients of the y’s in (11.1) 
1s a product of the differences between the 6’s by the coefficient c, in P»+,(8). 
Hence F has a representation R if and only if K has the property Pp44,p. 
It can be shown that F has a representation R’ + ppsy”"’, where R’ = 
Wat + Bay)’** [a = 1, 2,---,p + q — 1], and the are distinct, if and 
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only if 0F/dx has a representation dR’/dz. In view of Section 7 the lemma 
is now proved. 

Lemma 11.2. Let K be a field with characteristic not two, and order at least n. 
The field K possesses the property P,,; for cach odd integer 7. 

A pair (a, —a) of elements of K is called a 0-pair. The pair (0, 0) is termed 
the null 0-pair. 

We assume that 2 = 2k, where k is an integer. Since the characteristic of K 
is odd, the order of K is odd or infinite. There thus exist distinct non-null 
0-pairs (a; , —@1), (@2, --- , in K. In the expansion of 


(11.2) P(x) = (a + — a1) (x + — ae) + — ay) 


no terms of odd degree occur. 

We let m = 2k + 1, where k is an integer. We replace P(x) by xP(z) thus 
adjoining 0 to the a’s in (11.2), whence the lemma is proved. 

In the following theorem it is naturally assumed that the field K is such that 
I’, G and FG can be expressed as linear combinations of powers of linear forms. 

THEOREM 11.1. Let K be a field with characteristic not two and let F and G 
be forms of degrees p and q respectively. The minimal numbers of F, G, and FG 
with respect to K, satisfy the inequality: 


(11.3) m(FG) S (p + q)m(F)m(G). 


By Lemma 11.2, the field K possesses at least one of the properties P+,» 
or Py4¢-1,p-1 When p 2 2. By Lemma 8.1, and 11.1, 


mx’y*) Sp 


Since the product of a p-representation ;L? by a o-representation »;Mj is a 
sum of po terms of the form \ju;L? M$ (t, 7 not summed), the theorem is proved. 
By Lemma 8.1 the equality sign in (11.3) may be attained. 
We are now in a position to generalize a well-known theorem on the ranks 
of reducible quadratic forms. 
THeoreEM 11.2. If F is a form of degree p which splits into linear factors with 
coefficients in K, the minimal number of F with respect to K is not greater than p!. 
By Theorem 11.1, if F = GL, where L is linear, m(F) S pm(G). Repeated 
application of this principle yields the proof of the theorem. 


12. Equivalence of forms 


In what follows “equivalent” will be understood to mean ‘equivalent under 
non-singular linear transformations in the given field.” It will be assumed 
throughout this section and sections 13 and 14 that the field is such that the 
forms have unique symmetric matrices. 

A minimal representation of a form F in n essential variables is always of 
the type 


a=n+1 
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where the L’s are linearly independent. The d’s and the c’s in (12.1) form a 
set of elements S. The sets S for a given form F form a class C of sets which 
we shall term the minimal class of F with respect to K. 

THEOREM 12.1. T'wo forms F and F’ are equivalent for a field K, if and only 
if the minimal classes of F and F’ with respect to K are identical. 

A non-singular transformation on the variables of F can be represented by 
the equalities 


(12.2) L; = R; {i = 1, 2,--- , nj, 


where the R’s are linear forms in the new variables. Under a transformation 
(12.2), the minimal representation (12.1) goes into a minimal representation 
with the same set S of coefficients. 

THEOREM 12.2. Let [A, A] be a minimal representation of a form F. For a 
form F’ to be equivalent to F it is necessary that F have a minimal representation 
(\, A’] for some matrix A’ and the same d’s. If this condition is satisfied, F and F’ 
are equivalent if and only if there exist non-singular matrices U, X such that 
(12.3) A’ = UAX, 
where [\, UA] ts a representation of F. 

If we apply a transformation z; = b;;y; to a form F with representation 
[\, A] we obtain a form F’ with representation [A, AB], where B is the matrix 
(bi;). Sinee the variables in F are essential, the rank of A is the number of 
columns of A. Let A’ be a matrix of the same dimensions and rank as A. 
There exists a non-singular matrix U such that A’ = UA. Theorem 12.2 is 
now proved. 

The problem of equivalence of forms thus divides into the following parts: 

1. The determination of all minimal representations [A, UA] of a form with 

representation [A, A]. 
2. The solution of (12.3) for a non-singular matrix X. 


13. Remark on the equivalence of forms 
Let F and F’ be forms of degree p in n variables with symmetric matrices 
A = (4i;...4m) and A’ = (aj;...1s) respectively. We arrange the elements of A 
in a 2-way matrix A» with m as column index so that a typical row of Ag is 
In the same way we arrange the elements of A’ in a 2-way matrix A; with f 
as column index so that a typical row of A is 


The forms F and F’ are equivalent if and only if there exists a non-singular 
matrix x such that 


where ty is the direct product of (p — 1) matrices identical with zx. The 
matrix 2p is non-singular if z is non-singular, a fact upon which much of the 


| 
: 
ms. 
FG 
i 
| 
| 
f 
ler 
he 
of 
(13.1) | 


4 
if 


i 


704 RUFUS OLDENBURGER 


“rank” theory of the author developed elsewhere (VII) is based. Equation 
(13.1) is in appearance much like (12.3), but (12.3) is stronger than (13.1) in 
that the 2-way matrix A of (12.3) has in general less rows than Ag of (13.1) 
and the matrix U is often more restricted than 2p . 


14. Relations between minimal representations 


In the present section we determine the matrix U of (12.3) for non-singular 
forms. Two sums (a = 1, 2,---,¢) and (a = 1, 2,--- are 
said to be simply related if there exist elements 1 , v2, --- , ¥» in the given field, 
and an ordering of the M, so that 


Ma = velba, bala = [a not summed]. 


Evidently, simply related representations are representations of the same form. 
We shall express this in terms of matrices. We let J denote a permutation 
matrix (obtained from the identity matrix J by permuting rows of J). For 
each r there exists a c(r) such that the non-vanishing element in the r*" row of J 


occurs in the c(r) column. We choose we, ANA Vj ,c(1) , V2,c(2) , 
Ve,c(e) different from zero in the given field K so that 

(14.1) Ae(r) = 

We write 


V1,c(1) 0 QO .-. 0 
O O 0 


0 0 0 . -0 Vz,c(0) 


The representation [u, DJA] is simply related to [A, A], and each representation 
simply related to [A, A] is of this form. 

THEOREM 14.1. Let F be a non-singular form of degree at least three. The 
minimal representations of F are simply related. 

We let [A, A] and [u, B] be minimal representations of F. Since F is non- 
singular, A and B are non-singular. We write A = (a;;), B = (b;;) and take 
these to be of order n, whence F involves n essential variables. Equating 
minimal representations of F we obtain 


(14.2) +++ +++ Lm = +++ +++ Im, 


where the indices range over 1, 2,--- ,n. We make the transformation 2; = 
Cisys , Where (c,.) is the inverse of A, whence (14.2) yields 


(14.3) Addai eee SamYi Ym = ++: Ym; 


where (6ai) = --» = (Sam) = I, I being the Kronecker delta, and (vai) = BA”. 
We denote (vai) by A. We can write (14.3) as 


(14.4) ANabai SamYi Ym = Mabap SarYpi UrmYi 
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where (5a) = = (bar) = I. We write 


(14.5) Zp = Upi¥i 
whence (14.4) is an equality between the forms 


P= G = pati. 
a= a=1 


That is, the form G@ can be obtained from F by the non-singular linear trans- 
formation (14.5). The transformation (14.5) is then such as to bring a 
form G with diagonal matrix into a form with diagonal matrix. By a theorem 
of another paper [VI] the only transformations (14.5) with this property are 
those for which A = DJ, where D is diagonal, and J is a permutation matrix. 
We thus have a c(r) for each r so that va; ¥ 0 for [a, 7] = [r, e(r)], and zero 
otherwise. We can now write (14.4) as 


, Yn, We may equate coefficients, 
DJA and [y, B] is 


Since (14.6) is an identity in ye, 
whence we obtain (14.1). Since B = AA, we have B = 
simply related to [A, A], whence the theorem is proved. 

It can be proved that there exist forms with minima! representations not 
simply related. Thus, for example, not all minimal representations of x*y are 
simply related. 


15. Multilinear forms 


The proofs of the theorems in this section are in general similar to the proofs 
above for ordinary forms and will be omitted. The theorems of this section 
are analogues of theorems proved above. The given field of numbers is 
arbitrary. 

THEOREM 15.1. Let M = djj...mtiYj 


Zm be a multilinear form. The form 
M can be written as ‘ 


(15.1) -++ Na, 
a=1 
where the L’s, W’s, --- , N’s are respectively linear forms in the x’s, y’x, +--+ ,2’8 
only. 
We designate the matrices of the sets of linear forms [La], [Wal, --- , [Na] 
by A, B, --- ,@ respectively. If M is written as (15.1), we say that M has a 


representation [A, B,--- ,G]. We write D = (dj;...m). 

We term the minimum value of o for which M can be written in the form 
(15.1) the factorization rank of M (and D) with respect to the given field K. 
That the factorization rank of a form M depends on the field K can be proved 
by considering binary trilinear forms (VI). 
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We arrange the elements of the matrix D of M in a 2-way matrix D, with ; 
as row index so that a typical column of D2 has the shape 


We term the rank of Ds the t-rank of M. It is equal to the number of linearly 
independent forms in the set [Z.] of (15.1). 

THEOREM 15.2. Let the maximum of the 7, j, --- , m-ranks of the multilinear 
forms M of Theorem 15.1 be n. Let the degree of M be p. The factorization rank 
o of M satisfies the inequalities 


If o in (15.1) is the factorization rank of M the representation (15.1) of M 
is termed minimal. 

THEOREM 15.3. If [A, B,---,F, G] and [A, B,---,F, G’] are minimal 
representations of a multilinear form M, then G = G’. 

We term the z-variables in M essential if M cannot be brought by a non- 
singular linear transformation 


Li = 


into a form with less z’-variables. If the x’s in M are essential the 7-rank of M 
equals the number of 2’s. 

THreorEM 15.4. Let [A, B,--- ,G@] and [A’, B’, --- , G’] be minimal repre- 
sentations of multilinear forms M and M' respectively, where all of the variables 
in M and M’ are essential. The forms M and M' are equivalent if and only 
there exist non-singular matrices X, --- , U, Q, --- , S such that 


XAQ = A’,--- , UGS = G@’, 


where [XA, --- , UG] is a representation of M. 
We have used “equivalent” in the above theorem in the usual sense that M’ 
can be obtained from M by the independent non-singular linear transformations 


, 
Li = , Yi = = 
Two representations [A, B, -.- , F, G] and [A’, B’, --- , F’, @’] of a multi- 
linear form are said to be simply related if 
A’'=JUA, JVB,...,F' = JWF, 0 


where J is a permutation matrix, and U,-.-- ,W are non-singular diagonal 
matrices. 

TuEorEM 15.5. Let M be a multilinear form of degree p, p = 4, given as in 
Theorem 15.1, with all variables essential. Let the i, j, -.- , m-ranks of M be al 
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least two. Let M have a representation |A, B, .-- , F, where B, --- ,@ are 
non-singular. The minimal representations of M are simply related. 

A multilinear form M is said to be non-singular if M has a representation 
(A, B,--- , F, G], where A, B, --- , F, G are ordinary non-singular matrices. 

Theorem 15.5 implies the following corollary. 

CoroLLaRY 15.1. Let M be a non-singular multilinear form of degree at least 
four. The minimal representations of M are simply related. 

That minimal representations of a multilinear form are not always simply 
related, and further that Theorem 15.5 cannot be strengthened to include tri- 
linear forms is a consequence of the following example. 

Let J denote the identity matrix of order three. We let B = G = J, and 


2 0 1 0 1 1 0| 
A= ||2 Oj}, 1 O}. 
0 1 00 1 


For the field Ks , composed of the distinct elements 0, 1, 2, with characteristic 3, 
(A, B, G], and [Q, R, 7'] are minimal representations of the same trilinear form. 
These representations are not simply related. 


16. Relations between the factorization rank and minimal number 


We let F denote a form with symmetric matrix of coefficients. By applying 
the well-known polarization process to F we obtain, except for a constant factor, 
the multilinear form My = 4ijj...miYj +++ %m. We shall say that M, is the 
multilinear form associated with F. The factorization rank of My with respect 
toa field K is termed the factorization rank of F with respect to K. We denote 
the factorization rank of F by f(F). 

We let [F] denote the class of forms F of degree p in n essential variables. 
We let K be such that the symmetric matrices of the forms in [F'] are unique. 
The minimum value of the factorization ranks of the forms F in [F] is n.  Else- 
where [VI] the author proved that a form F is equivalent to a form 
Ma? + + + An O] if and only if the associated 
multilinear form is equivalent to +--+ + +--+ Zn. We thus 
have the following result. 

TaroreM 16.1. Jf the minimal number of F in [F] is a minimum for [F], the 
factorization rank of F is a minimum for [F], and these minimum values are equal; 
and conversely. 

For the case of quadratic forms the minimal number always takes on its 
minimum value. 

is non-singular in the sense of another paper [X] of the author. Conditions for 
non-singularity of a multilinear form given in that paper may thus be applied 
to dete..aine whether or not F is non-singular. 

Elsewhere [VI] we obtained directly conditions for the non-singularity of a 
form F. These conditions are expressed in terms of generalized determinants 
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involving variable elements, and in terms of the sub-forms F,, F,, --- , F,, of 
F = ae 

We define the factorization rank f(A) and minimal number m(A) of a sym- 
metric matrix A = (d;;...m) to be the smallest ¢ for which 


A= (x don) or (2 ban) 
a=1 a= 


respectively. That the factorization rank, and minimal number are not always 
equal, at least in the matrix case, follows by considering an example. 

We let A be the 6-way matrix A = (ijxym) of order 2 all of whose elements 
vanish except those which have exactly one subscript equal to 1. The non- 
vanishing elements in A are assumed to be equal to 1. If 2,3 ¥ 0, A is the 
matrix of 6zy’. We let K be the field of order 8 and characteristic 2. For K, 
the matrix A has a representation 


6 
( Daidaj Dak Dat das dam) 


with by, = be = --- = be = 1, and die, bee, --- , beg distinct. Hence f(A) < 7. 
Since there is no vanishing [XII] sum of 6 distinct elements in K, it follows 
that m(A) > 6. Hence m(A) # f(A). 


17. Note on non-singular forms 


In this section the field K is assumed to be such that the forms which occur 
have unique symmetric matrices. We prove here that if we write a non- 
singular form F as 2x,F; there exist values p; of z; so that p;F’; is non-singular. 
This property simplifies the determination of the non-singularity of a given 
form. We shall consider first the analogue for multilinear forms, from which 
the results for ordinary forms are readily derived. 

Lemma 17.1. Let a multilinear form G = 2G; in essential variables 2; , 
Yi, +++ ,%m be of degree at least 3, and let G have a representation [A, B, --- , D| 
where B, .-- ,D are non-singular. There exist numbers p; so that pi; 1s non- 
singular. 

Since B, ..- , D are non-singular, the indices j, --- , m have a common range 
1, 2,---,m, whereas the range 1, 2, .-- ,m of i may be different from this. 
We note that G can be reduced under non-singular linear transformations to a 
form G’ with representation [A, I, --- , I], I being the identity matrix of order 1. 
The reduction is performed by operating on the vector 'spaces (yi), «++» @m) 
with transformations whose matrices are .-- , respectively. 

We write G = Evidently G’ = = 1, 2,---,m; 
t = 1,2,---,n], where A = (ay). Thus = aupy:--- 2. If we write 
L, = the form is simply 
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If L; = 0, we have a; = 0 for each 7, whence G’ is free of the variables 
est: ,2, contrary to assumption that the variables of G (hence G’) are essen- 
tial. Hence we can choose the p’s so that (17.1) is non-singular. Applying 
the transformations y: = bijy;, --- , 21 = dimen to G’ to return to G, the form N 
transforms covariantly into a non-singular form p;G; . 

TurorEeM 17.1. Let F be a non-singular form of degree at least 3. There exists 
a non-singular linear combination of the sub-forms of F. 

We write F as Qj;...m%i%j +++ tm, Whence Mp = We intro- 
duce the notations FP; = @jj...m%j 2m, Mi = Gij;...mYj 2m. By Theorem 
16.1, the form My, is non-singular. By Lemma 17.1 there exist numbers p; 
such that p:M; is non-singular. Since piM; = M,,r,, Theorem 16.1 implies 
that p;/’; is non-singular when p;M; is non-singular. The proof of Theorem 17.1 
is completed by applying Lemma 17.1 to M,. 

It can be shown that a transformation on 2, 22, --- , 2, that brings a form 


Mati + Ages + Ae, An ¥ O], p = 3, 


with diagonal matrix into a form with diagonal matrix brings any form in 
1,22, +++ , 2%, With diagonal matrix into a like form. From this and Theorem 
17.1a form F = 2,F; of degree p, p = 4, in n variables is non-singular if and 
only af: 

a) There exist values p; of x; so that p,F; is non-singular. 

b) If (a) is satisfied let pF; be chosen non-singular and reduce pf’; by a 
non-singular linear transformation 7’ to a form with diagonal matrix. Let the 
range of « be taken so that p, ~ 0. Let the transformation 7 bring 
F,,---,F, into --- , F, respectively. Then F;, --- , F, are forms with 
diagonal matrices. 

The present paper suggests various problems whose complete solution is still 
forthcoming. Among these are the determination of the exact range of the 
minimal number, the relation between minimal representations, and necessary 
and sufficient conditions that a representation be minimal. Applications of the 
theory to classical problems of algebra will appear elsewhere. 
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